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Au revoir 


As Britain leaves the European Union, a 
future research relationship must be built on 
continued collaboration — and compromise. 


tthe stroke of 11 p.m. on 31January, researchers 

will be among those lighting candles to mark 

the United Kingdom’s departure from the 

European Union. Most scientists didn’t want 

Brexit. Scientists co-founded a national 
campaign for a second referendum on the country’s EU 
membership, fearing that the split would destroy a union 
that has promoted collaboration and the advancement of 
knowledge for the past 47 years. 

They were unsuccessful — and the next 11 months willbe 
an uncertain time as the United Kingdom enters atransition 
period in which it must negotiate the terms of its future 
links with the EU. UK researchers do not yet know if they 
will be part of joint EU research programmes. However, 
in one piece of welcome news, the futures of the many EU 
researchers who want to make their lives in Britain now lie 
in the hands of the UK Research and Innovation agency, 
and not the UK Home Office — an institution that, for much 
of the past decade, enacted and implemented policies to 
reduce immigration to the United Kingdom. 

But it’s not only the act of separation that will be so 
painful. Research has always been central to European 
unity. Back in 1973, Britain joined much more than the 
‘European Economic Community’, which is what later 
became the EU. It joined an organization that had been 
founded to further anumber of principles — that itis better 
for nations in the same neighbourhood to make decisions 
together; that conflicts must be resolved through nego- 
tiation and compromise; and that it benefits everyone if 
richer countries can assist poorer ones. 

Each of these principles was rooted in the same goal — 
to avoid a repeat of the brutal war from which the EU’s 
founders had emerged. The continent’s scientists were very 
much part of the Second World War, whichis why the bloc’s 
architects deliberately planned for joint research — and 
especially technology — to be part of the glue for European 
integration. The idea that research cooperation can assist 
peace-building has inspired other regional unions, from 
Africa to Latin America and the Caribbean. 

As the EU was taking shape, some of Britain’s scientific 
leaders were sceptical about collaborating with European 
colleagues. But scepticism faded as the benefits of a 
European research area — not unlike that of the United 
States — with pooled funding and freedom of movement 
became clear. Britain’s researchers received the second 
largest share of their country’s EU funds (after farmers), and 
as of June 2019, this was more than €1 billion (US$1.1 billion) 
a year. Much of this went towards collaborative projects 
with societal impact, such as the European Social Survey, 
which tracks changing social attitudes in 38 European 
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nations; the Clean Sky programme, in which researchers 
from 27 countries work together to reduce aircraft carbon 
emissions and noise; or the Innovative Medicines Initiative, 
where researchers collaborate to develop medicines for 
under-served conditions such as tuberculosis. 


Free movement 


Mindful of what it could lose, the governmentis investigat- 
ing whether Britain could pay to join — or be ‘associated’ 
with—EU programmes. But at the same time, ministers have 
been preparing for the possibility that this will not happen. 
That’s because UK membership of EU research programmes 
will require some form of freedom of movement and, so far, 
the UK government has said freedom of movement must end 
inits current form. Joining EU programmes also depends on 
the wider terms of the United Kingdom’s future EU relation- 
ship to be thrashed out this year. Within government, there 
are varying views as to how close Britain should remain to 
the laws and regulations of its former EU partners — what is 
called ‘alignment’. But from the perspective of researchers 
and society as a whole, there are more benefits in alignment 
than in moving out of the EU’s orbit, at least for now. 

It’s in everyone’s best interests that Britain should 
continue to align, to some degree, with the continent’s 
carefully crafted decision-making frameworks on the envi- 
ronment, data, employment, food safety and the broader 
life sciences. Environmental issues are international by 
their very nature, so there is no sense, nor logic, in Britain 
creating its own standards, or aligning with a minority of 
countries. The same applies for regulations on technolo- 
gies such as those involved in gene editing and artificial 
intelligence. UK researchers must be part of global efforts. 


Regional aid 


The United Kingdom should also consider continuing to 
contribute to what is called ‘cohesion funding’. These are 
EU funds that goto poorer regions, and include funding for 
research. Between 2014 and 2020, Britain’s less developed 
regions — including those in Wales, Cornwall and the north- 
east of England — received €10.6 billionin cohesion funding. 
Under EU rules, this contribution must be matched from 
national sources, doubling the available money. 

The government has pledged to replace some cohesion 
funding through a ‘Shared Prosperity Fund’, and ona 
visit to northeast England last week, science minister 
Chris Skidmore promised to “level up” research funding 
between the UK regions. Such an announcement could not 
have come sooner. The government’s own data show that, 
in 2017-18, the northeast and Wales each received 2% of 
Britain’s gross expenditure on research and development. 
By contrast, 35% went to institutions in London and the 
southeast of England. 

But at a conference organized by the Institute for Public 
Policy Research in Newcastle upon Tyne, UK, on 22 Jan- 
uary, it was pointed out that the various regions will be 
required to compete with each other for this new funding 
— creating winners and losers. And there were fears that UK 
funding for the poorest communities would be less than 
what has been obtained through EU funding. “Nothing I’ve 
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heard says the government knows what to do to solve 
the problems of the North of England,” said political 
scientist Anand Menon, director of the UK ina Changing 
Europe research programme. 

The government needs to listen to these warnings if it 
is to make good on its promises. As the prime minister 
and his senior advisers start to make decisions in the 
coming months, leaders of the research community 
must continue to use their influential voice and ensure 
that UK researchers are generously funded, that under- 
served communities do not lose out and that research 
policies continue to reflect an international consensus. 


Dahrendorf’s wish 


“The European Union is seen by many as a model. And 
Britain must be part of that model.” The German-British 
philosopher Ralf Dahrendorf wrote these words in 
frustration in 1995 when a previous Conservative 
UK government found itself in turmoil over Britain’s 
relationship with the EU. 

Dahrendorf was unusually placed in that he could see 
Britain’s relationship with the EU from many sides. He 
could see how EU membership was being interpreted 
by some as a loss of sovereignty in a post-imperial 
nation. But as a refugee from Nazi Germany who had 
been welcomed by Britain, he also understood the EU’s 
crucial importance as the backstop against a return to 
authoritarianism in continental Europe. 

Dahrendorf’s wish will be denied. But it is essential 
that both the United Kingdom and the remaining 
27 EU member states and EU institutions do not let 
Brexit diminish that mission. The EU exists to protect 
democracy and the rule of law. It ensures continued 
peace and prosperity through negotiation and compro- 
mise, through the freedom to travel and trade, and by the 
strong helping the weak — all catalysed by knowledge, 
research and innovation. These are values to which every 
nation must aspire, including the United Kingdom even 
if it is no longer part of the EU. 


What you want 
Natureto do next 


We asked readers what we should 
focus on in the next decade. Here is what 
you said. 


ate last year, as Nature marked its 150th 
anniversary, we spent time reflecting on our 
values and howwe could improve. We were keen to 
hear from you, our readers, so we put upa survey 
that asked: “What activity do you think is most 
important for Nature to focus on over the next decade?” 
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We're 
delighted to 
see SO many 
readers urge 
us to focus 
on global 
challenges. 
You're right.” 


More than 500 of you responded (thank you!) and four 
things stood out. You would like Nature to make papers 
and data easier to access; help readers to find and digest 
research more easily; work to improve research integrity; 
and publish and communicate research that addresses 
global challenges. These responses and the many 
additional comments you sent will guide us as we chart a 
course into the new decade. 

Some readers urged us to improve accessibility in its 
broadest sense. We should make research “easier for very 
curious but non specialized people to read”, wrote one. We 
strive to ensure that abstracts to research articles and all 
our news and opinion content are clear and engaging for 
readers from different fields and at different career stages. 
But it’s important to be reminded, as another reader told 
us, that palaeontology should be accessible to a neuro- 
scientist and vice versa. It’s equally important, as a reader 
from Mexico said, that we maintain “truthfulness and 
impartiality in the global dissemination of science’. 

We're pleased that many readers complemented Nature's 
daily Briefing, our round-up of essential reading in research 
news from across the world. We want to do more to help 
readers make sense of what can be an overwhelming 
volume of information about new research. 

Respondents also urged us to do more to make science 
open by further embracing open access, open data and 
reproducibility — including publishing peer-reviewed work 
that reproduces the results of previous studies. These views 
are informing our direction. And quite a few readers asked 
why Nature covers policy and politics in science; we see this 
as anessential part of what we do because policy decisions 
affect research and researchers’ lives, and because the out- 
come of research affects policy decisions, too. 

We're delighted to see so many readers urge us to focus 
on global challenges. You're right. Nature is committed 
to publishing and reporting in the areas that fall under 
the United Nations Sustainable Development Goals, and 
2020 will bea busy year. In October, countries that belong 
to the UN Convention on Biological Diversity will meet 
in Kunming, China, to update global goals to reduce bio- 
diversity loss. The following month, world leaders will 
gather in Glasgow, UK, to agree on a new and hopefully 
more ambitious set of climate targets. Before that, inJune, 
representatives of many nations will convene in Lisbon with 
scientists, businesses and campaign groups for the UN 
Oceans Conference. We'll be covering all of these events. 

You also told us in our poll that we should focus on 
increasing the diversity of our authors and contributors. 
We are redoubling our efforts here — and also reiterate 
our commitment from last December to having no more 
male-only speaker panels and organizing committees for 
Nature events. At the same time, we fully recognize that 
there is much more that we need to do. 

Science, as one reader reminds us, is an essential part of 
humanity’s heritage. “I hope that Nature can keep improv- 
ing, and make [an] effort to preserve and select important 
information in a way that future generations can access 
them easily.” 

We very much agree. 
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GEORGINA FERRY 


A personal take on science and society 


World view 


By Georgina 
Ferry 


Remember what science 
owes to child refugees 


Callous policies in Brexit’s wake will ill serve 
anation that claims to cherish innovators — 
learn from 1930s Britain, says Georgina Ferry. 


hen the British House of Commons voted 

this month not to uphold child refugees’ 

right to join family members inthe United 

Kingdom, I was reminded of something. 

As ascientific biographer and obituarist 
(for Nature and the newspaper The Guardian), |'m regularly 
struck by how much leading scientific nations have gained 
from people to whom they once gave sanctuary. 

Every life — whether of an adult or a child — is valuable, 
and people should be saved for humanitarian reasons alone. 
But it’s worth remembering that the country benefits. 

Take three pioneering researchers whoall died in Decem- 
ber 2019, and all had one other thing in common: they came 
to Britain in 1938-39 as unaccompanied child refugees 
from Europe. Hans Kornberg elucidated the reactions that 
microbes use to exploit carbon sources; Hannah Steinberg 
identified some of the effects of psychoactive drugs and 
drug combinations on behaviour; and Leslie Brent rec- 
ognized a mechanism underpinning immune tolerance. 

Current British policy on child refugees is callous. There 
are perhaps a few hundred such children waiting to join sib- 
lings, uncles, aunts or grandparents inthe United Kingdom. 
Most have either lost their parents or left them behind in 
their war-ravaged countries. 

Under European Union law they currently have the right 
to rejoin their families, but this is set to be revoked now 
that the Withdrawal Agreement Bill that sets terms for Brit- 
ain’s relationship with the EU after Brexit on 31 January has 
passed its final stage. After winning a convincing electoral 
victory in December, Conservative lawmakers ditched akey 
amendment guaranteeing the rights of child refugees; an 
attempt to reinstate it earlier this month failed. 

Contrast this with the action of the British government 
in 1938. In November that year, on what has become known 
as Kristallnacht, Jewish homes, businesses and synagogues 
were ransacked across Germany with the connivance of the 
Nazi government. Days later, the UK House of Commons 
debated the situation, and the home secretary agreed 
to admit an unspecified number of Jewish children from 
Germany and neighbouring countries through a system 
later known as the Kindertransport. Most of the children 
travelled by train from Vienna, Berlin or Prague. In the 
months leading up to the outbreak of war in September 
1939, around 10,000 came to the United Kingdom. 

Steinberg fled Vienna aged 14, in 1938. She stayed with 
relatives and then a foster family; her mother took her own 
life and her father fled to Israel. In Britain, Steinberg was 
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able to study psychology at University College London, 
where she went on to hold one of the world’s first chairs 
in psychopharmacology. In retirement, she continued 
her research, helping to establish an evidence base onthe 
psychological effects of exercise. 

Kornberg, later knighted for his services to science, 
arrived in 1939 from northern Germany. Aged 11, he went 
to live with an uncle; both his parents died in the Holocaust. 
On leaving school, he took ajob as a technician with Hans 
Krebs, aJewish biochemist fired in 1933 by the University 
of Freiburg, Germany, because of his heritage. Krebs had 
just discovered the cycle of energy transformation in cells, 
for which he later won a Nobel prize. Kornberg went onto 
reach heights of distinction he could not have dreamt of on 
the cold, crowded train from Berlin, including gaining the 
chair in biochemistry at the University of Cambridge, UK. 

Brent (born Lothar Baruch) was a key contributor to 
Peter Medawar’s Nobel-prizewinning work on induced 
immunetolerance. At age 13, Brent was on the first Kinder- 
transport from Berlin. Both his parents were transported 
to Latvia and shot. He came to the United Kingdom without 
any family to greet him, and worked as a laboratory assis- 
tant before achieving academic success. 

Others who endured appalling upheavals in their child- 
hoods and yet thrived after the Kindertransport are still 
alive, most notably Steve Shirley, who shattered the glass 
ceiling for women in computing and pioneered home- 
based programming. Made a dame for services to infor- 
mation technology in 2000, she says: “I determined at a 
ridiculously young age to make mine a life that had been 
worth saving.” Others moved to the United States, of whom 
three — Walter Kohn, Arno Penzias and Jack Steinberger — 
won Nobel prizes in chemistry or physics. All would prob- 
ably have perished but for a political decision. 

In 1938, Britain had only recently emerged froma reces- 
sion. It was heading into a war, and national debt remained 
stubbornly high. Yes, more could have been done, but for 
child refugees common humanity prevailed. It should doso 
again, in Britain and beyond. (As an aside: there is abundant 
evidence that immigration benefits economies, and that 
people overestimate true levels of migration by orders 
of magnitude. The argument that wealthy nations must 
safeguard their economies from becoming ‘overwhelmed’ 
by ‘uncontrolled’ immigration has no evidence base.) 

Without a legal means to access the United Kingdom, 
children will risk their lives in the unseaworthy craft that 
are increasingly trying to cross the busy and often wind- 
tossed English Channel, or stow away inthe kind of lorry in 
which 39 Vietnamese people suffocated in October 2019. 

Out of simple human compassion — and in honour of the 
huge contributions made by refugee researchers — should 
not scientists speak up for these children? 
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The world this week 


Newsin brief 


SOFTWARE TRACKS RIGOUR OF 
SCIENTIFIC PAPERS OVER TIME 


Researchers are getting better 
at communicating science in 

a rigorous and reproducible 
way, according to a text-mining 
analysis of around 1.6 million 
papers. But the findings have 
also sparked fears that progress 
is too slow. 

The study used software 
called SciScore, which gives 
papers a mark out of ten for 
‘rigour and transparency’ 

(J. Menke et al. Preprint on 
bioRxiv http://doi.org/dkg6; 
2020). SciScore searches 

the text in papers’ methods 
sections for around 20 pieces 

of key information, which act 

as proxies for how rigorous the 
experiments are, and how easy it 
would be for other researchers 
to reproduce them. The 
software can flag where authors 
have specifically identified the 
reagents and tools they use, 
such as antibodies, software, cell 
lines or transgenic organisms. 

It also checks whether they 

have discussed factors such as 
sample sizes, how tests have 
been blinded or the sex of 
animals used. 

The researchers who 
created SciScore — led by Anita 
Bandrowski, an information 


scientist at the University of 
California, San Diego — analysed 
1.58 million freely available 
life-sciences papers indexed in 
the PubMed Central database. 
They found that between 1997 
and 2019, the average score 
across all papers more than 
doubled, from 2 out of 10 to 4.2. 
The analysis also showed that 
individual measures of rigour 
are onthe rise. For example, less 
than 10% of papers published in 
1997 discussed randomization 
inthe methods; this had risen 

to around 30% in 2019 (see 
‘Rigorous research’). 

But the numbers overall 
haven't increased as much as 
some researchers would like. By 
calculating the average SciScore 
rating for all the papersina 
given journal, Bandrowski and 
her colleagues created a metric 
they dubbed the Rigor and 
Transparency Index. Although 
the study finds that all journals’ 
average scores have increased 
since 1997, no title among 
those analysed has an index of 
more than five out of ten. This 
suggests that “less than half of 
the rigor and reproducibility 
criteria are routinely addressed 
by authors’, the study says. 


RIGOROUS RESEARCH 


Scientists are increasingly publishing details about 
the rigour and reproducibility of their experiments. 
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ANCIENT AFRICAN 
GENOMES OFFER 
GLIMPSE INTO EARLY 
HUMAN HISTORY 


Researchers have sequenced the 
genomes of four children who 
lived in what is now Cameroon 
several thousand years ago. 

Their genomes — the first to 
be collected from any ancient 
human in West Africa — raise 
questions about the origins of 
a migration called the Bantu 
expansion, which carried 
languages and agriculture 
across the continent around 
3,000-5,000 years ago. They 
also hint at older events in 
human history, such as the 
emergence of Homo sapiens and 
its spread out of Africa. 

David Reich, a population 
geneticist at Harvard 
Medical School in Boston, 
Massachusetts, and Mary 
Prendergast, an archaeologist 
at Saint Louis University — 
Madrid Campus, analysed 
remains froma rock shelter in 
Cameroon called Shum Laka. 
They generated full genomes 
for two young boys, who lived 
8,000 and 3,000 years ago, and 
collected more limited genome 
data froma boy anda girl from 
the same periods, respectively. 

A genetic analysis, published 
on 22 January (M. Lipson 
etal. Nature http://doi.org/ 
dkh4; 2020), showed that 
all four children descended 
froma group of Homo sapiens 
that branched off from the 
common ancestors of our 
species more than 200,000 
years ago. 
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Nations 
appeals for 
locust help 


Sign up to get essential science 
news, opinion and analysis 
delivered to your inbox daily. 
Visit go.nature.com/newsletter 


Hundreds of millions of desert locusts, in swarms 
larger than cities, are ravaging East Africa. The Food 
and Agriculture Organization of the United Nations 
and anumber of aid agencies are appealing for urgent 
help to deal with the crisis. Kenya has been worst hit — it 
has not seen locusts on this scale for 70 years — but the 
infestation has also struck Ethiopia (pictured) along 
with Somalia. Desert locusts (Schistocerca gregaria) 
have been breeding in large numbers because of 
unusual weather patterns, including heavy rains. The 
UN says that they need to be controlled by pesticides 
dropped from the air. If left unchecked, the locust 
population could grow 500-fold by June. A swarm the 
size of Paris will eat the same amount of food ina day as 
half the population of France. 


US OFFICIALS 
REVISIT RULES FOR 
RISKY DISEASE 
EXPERIMENTS 


US disease researchers are 
pushing the government to 

be more transparent about 
federally funded research that 
involves making pathogens 
more deadly or more 
transmissible. 

Several scientists who 
attended a meeting of the 
National Science Advisory 
Board for Biosecurity (NSABB) 
on 23-24 January say that the 
US government should offer 
a public explanation when 
it approves such ‘gain-of- 
function’ experiments, which 
are designed to help improve 
responses to outbreaks. The 
scientists also call for the 
government to disclose who 
decided to fund such research, 
and make a broad public 
announcement whena study 
begins. Others say that greater 
transparency could make it 
harder to approve necessary 
research. 

The NSABB is reviewing 
guidelines for sharing 
information on gain-of-function 
research at the request of the 
National Institutes of Health 
(NIH) and the White House. The 
board is an independent panel 
that advises the NIH’s parent, 
the Department of Health and 
Human Services. 

The debate over how much to 
disclose is revving up because 
the government is preparing to 
revisit rules that guide gain-of- 
function research — especially 
with regard to communication 
with the public. 

“We're not trying to say the 
policy is wrong — we’re trying 
to say the policy is ambiguous,” 
says Marc Lipsitch, an 
epidemiologist at the Harvard 
T.H. Chan School of Public 
Health in Boston, Massachusetts, 
and one of the researchers 
calling for more transparency. 


fs yo 3 ae ee 
ABORIGINAL SITES 

PROBABLY DAMAGED 
IN AUSTRALIAN FIRES 


Indigenous communities and 
archaeologists fear thousands 
of historic Aboriginal sites and 
artefacts have been damaged by 
fires that have ravaged Australia. 

Since September, fires have 
razed more than 10 million 
hectares, mostly in the eastern 
states of Queensland, New 
South Wales and Victoria. 

Much of that land isin 
national parks and other forests, 
where tens of thousands of 
important Indigenous sites 
are found, including many 
that have not been officially 
recorded, says Tiina Manne, an 
archaeologist at the University 
of Queensland in the Gold Coast 
and president of the Australian 
Archaeological Association 
(AAA). 

These sites show where 
people lived and how they 
moved over tens of thousands 
of years, and help to reveal 
the development of cultural 
practices such rock art 
(pictured). 

Researchers have yet to 
doa formal assessment, but 
Manne says thousands of 
Aboriginal sites will have been 
affected. The AAA is calling on 
the government to conduct 
assessments of cultural sites as 
part of its fire-recovery plans. 

The fires might also have 
opened up densely vegetated 
areas, potentially revealing 
undocumented sites, says Paul 
Tacon, who studies rock art at 
Griffith University, Gold Coast. 
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The world this week 


News in focus 
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Paramedics transport a man thought to be the first person in Hong Kong to have contracted the new coronavirus. 


WHAT SCIENTISTS WANT 
TO KNOW ABOUT THE 
CORONAVIRUS OUTBREAK 


Researchers have sequenced the deadly pathogen’s genome — 
and are nowrushing to find out how it spreads and how deadly itis. 


By Ewen Callaway and David Cyranoski 


he world is racing to learn more about 
the outbreak of anew viral infection that 
was first detected in Wuhan, China, last 
month and is causing increasing alarm 
around the world. 

As Nature went to press, officials in China 
had confirmed more than 4,500 cases of the 
virus, which causes a respiratory illness, and 
some 100 deaths. Around 50 cases had also 
been confirmed in other countries, in Asia, the 
United States and Europe. 


Researchers fear similarities to the 2002-03 
epidemic of severe acute respiratory syn- 
drome (SARS), which emerged in southern 
China and killed 774 people in 37 countries. 
Both are members ofa large virus family, called 
coronaviruses, that also includes viruses 
responsible for the common cold. 

China has taken unprecedented actiontotry 
to halt the outbreak — including putting Wuhan 
and nearby cities on ‘lockdown’, restricting 
travel in and out of the cities. For now, the 
World Health Organization (WHO) has held 
off declaring a public-health emergency of 
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international concern — the agency's highest 
level of alarm — after a meeting of officials last 
week, but that could change. 

Nature rounds up the questions at the heart 
of scientists’ efforts to understand the virus. 


How does the virus spread? 

This is the most urgent question surrounding 
the outbreak. Chinese authorities have con- 
firmed that it spreads from person to person 
after identifying clusters of cases among fam- 
ilies, as well as transmission from patients to 
health-care workers. Monitoring the rate at 
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which newcases appear, and when symptoms 
began for each case, should reveal how easily 
the virus passes between humans and whether 
the outbreak has the potential to persist. 

One figure that epidemiologists want is the 
number of people that someone with the virus 
tendstoinfect —knownas Rp. An, of morethan 
1means that countermeasures, suchas quaran- 
tine, willbe needed to contain the spread. The 
WHO last week published an estimated R, of 
1.4-2.5. Other teams suggest slightly higher 
values!”. These estimates are similar to the Ry 
of SARS during the early stages of the 2002-03 
outbreak, and of the novel strain of H1N1 influ- 
enza that caused a pandemic in 2009. But they 
are higher than R, values estimated during out- 
breaks of the Middle East respiratory syndrome 
(MERS) virus, a coronavirus similar to SARS. 

“Now it’s in the range of these other impor- 
tant epidemics, and that indicates the potential 
that it will cause a similar scale of public-health 
concern if nothing else happens,” says 
Mark Woolhouse, an epidemiologist at the 
University of Edinburgh, UK. 

But researchers caution that Ry estimates 
come with large uncertainties because of gaps 
in the data, and the assumptions used to cal- 
culate the figure. They also point out that the 
Ryisamoving target and that estimates of the 
figure change over the course of an outbreak — 
as control measures are implemented. In the 
coming days, health authorities and research- 
ers will be looking for signs that the travel 
restrictions in Wuhan and other Chinese cities, 
and other steps taken by to stem transmission, 
have reduced the R, there. 


Can infected people spread the 
virus without showing symptoms? 
Another major unanswered question is 
whether — and how extensively — people 


without symptoms can infect others. A study’ 
of a cluster of six infections in a family in 
Shenzhen identified a child who was infected 
with the virus but showed no symptoms. If 
such asymptomatic cases are common and 
these individuals can spread the virus, then 
containing its spread will be much more dif- 
ficult, researchers say. Key to controlling the 
SARS virus was the fact that few cases were 
asymptomatic. 

“Defining the scale of asymptomatic trans- 
mission remains key: if this is arare event then 
its impact should be minimal in terms of the 
overall outbreak,” Jonathan Ball, a virologist 
at the University of Nottingham, UK, saidina 
statement distributed by the UK Science Media 


“Defining the scale of 
asymptomatic transmission 
remains key.” 


Centre. “But, if this transmission mode is con- 
tributing significantly then control becomes 
increasingly difficult.” 

One way to determine whether symp- 
tom-free people can spread the virus would 
be to study its spread in individual households 
in China, says Sheila Bird, a biostatistician at 
the University of Cambridge, UK. By monitor- 
ing all the members of a household in which 
one person is infected, it should be possible to 
determine whoelse contracts the virus and how. 
Such studies would also be helpful for identi- 
fying ways of stopping spread in households, 
Bird adds. 

Raina MacIntyre, an epidemiologist at the 
University of New South Wales in Sydney, 
Australia, says that although the rise in cases 
probably reflects an increase in testing for 


Public-health staff check the temperatures of people arriving from China at Bangkok airport. 
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and detecting the virus, the dramatic jump is 
concerning. “It’s very mucha dynamic picture, 
and until we have an indication that cases 
are declining, it’s going to continue to be of 
concern,” she says. 

But MacIntyre also notes that researchers 
are struggling to accurately model the out- 
break, and to predict how it might unfold, 
because the case-report data being released 
by Chinese authorities are incomplete. “What 
we need to identify is when people got sick, not 
when the cases were reported, and all we’ve 
seen so far is when the cases were reported.” 


How deadly is the virus? 

High rates of pneumonia among the first 
people infected had many researchers wor- 
ried that the Wuhan virus was especially 
pernicious. Those concerns have receded 
slightly, as more mild cases have turned up. 
With some 100 deaths in more than 4,500 
reported cases, the virus does not seem to be 
as deadly as SARS — which killed around 10% 
of the people it infected. But “It’s too early 
to be sanguine about the severity”, says Neil 
Ferguson, a mathematical epidemiologist at 
Imperial College London. 


Where did the virus come from? 


Authorities are working onthe theory that the 
virus originated in an unidentified animal or 
animals, and spread to humansatan animal and 
seafood market in Wuhan, whichis now closed. 
Identification of the animal source of the virus 
could help officials to control the current out- 
break and gauge its threat — and potentially 
prevent future epidemics, say researchers. 

Genetic sequencing suggests that the 
Wuhan virus is related to coronaviruses that 
circulate in bats, including SARS and its close 
relatives. But other mammals can transmit 
these viruses — SARS was probably spread to 
humans by civet cats. 

The market in Wuhan also sells wild animals. 
And inacontroversial study* published last 
week, ateam of researchers in China who had 
performed a genetic analysis suggested that 
the virus jumped to humans from snakes. But 
other scientists were sceptical of the study, 
and said that there is no proof that viruses such 
as those behind the outbreak can infect spe- 
cies other than mammals and birds. “Nothing 
supports snakes being involved,” says David 
Robertson, a virologist at the University of 
Glasgow, UK. 

Many researchers think that the animal 
host or hosts of the virus won't be identified 
without further field and laboratory work. 
And many hope that genetic tests of animals 
or environmental sources will turn up clues. 

Amammalis the most likely candidate, says 
CuiJie, a virologist at the Pasteur Institute of 
Shanghai who was part of a team that identi- 
fied SARS-related viruses in bats from a cave 
in Yunnan province in southwestern China 
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The city of Wuhan, China, is on lockdown in an attempt to halt the coronavirus outbreak. 


Se 


in 2017. SARS and the new virus are part of a 
subgroup knownas betacoronaviruses. Field- 
work in the wake of the SARS outbreak has 
found such viruses only inmammals, Cui says. 


What can we learn from the virus’s 
genetic sequence? 


Genetic sequencing of the Wuhan corona- 
virus offers clues to its origins and spread. 
Labs in China and Thailand have sequenced 
the genomes of more than 20 strains found 
in infected people and have made them pub- 
licly available. That’s “pretty remarkable”, 
says Trevor Bedford, an evolutionary genet- 
icist at the Fred Hutchinson Cancer Research 
Center in Seattle, Washington, who is analys- 
ing the sequences as they come in. “People 
are extremely fast and excellent about data 
sharing,” he adds. 

Bedford and other geneticists are using the 
data to determine when the virus emerged — 
current estimates point to November 2019. 
Viral sequences, Bedford adds, could identify 
any genetic changes that might have helped the 
virus make the jump from animals to humans. 
And if there is extensive human-to-human 
transmission, Bedford and other geneticists 
will be looking for signs that the virus has 
gained further mutations that are enabling it 
to spread more efficiently in humans. 

Bedford cautions that any conclusions are 
preliminary, because so few data are availa- 
ble. “Adding a few key samples can change the 
story significantly,” he says. 


Cana drug be developed to treat 
the coronavirus? 

No drugs have been shown to be effective in 
treating SARS or other coronavirus infections 
inhumans, andnovaccines aimed at preventing 


these infections have been licensed. 

A team at China’s National Engineering 
Research Center for the Emergence Drugs 
in Beijing is working on finding therapies 
that would work by blocking the receptor on 
human cells that the virus latches on to and 
uses to infect the cells. A comparison of the 
SARS and new China virus sequences, pub- 
lished on 16 January, found that they prob- 
ably bind to the same receptor. The team is 
hoping to revive efforts to develop treatments 
for SARS and adapt them ina bid to developa 
drug that could work against the latest virus. 

Another researcher who has been develop- 
ing drugs for coronaviruses since the SARS 
outbreak is hoping to test drug candidates in 


“People are extremely 
fast and excellent about 
data sharing.’ 


animal models of the Wuhan virus (see ‘Q&A: 
Rolf Hilgenfeld’). 

Chinese authorities are also testing whether 
existing HIV drugs can treat the infection. 
Ritonavir and liponavir, which are approved to 
treat HIV, are being given to people with pneu- 
monia caused by the coronavirus, according to 
media reports and a 26 January statement by 
the Beijing branch of China’s National Health 
Commission. 


1. Read, J. M. et al. Preprint at MedRxiv https://www.medrxiv. 
org/content/10.1101/2020.01.23.20018549Vv1 (2020). 
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3. Chan, J. F.-W. et al. Lancet https://doi.org/10.1016/S0140- 
6736(20)30154-9 (2020). 

4. Ji, W., Wang, W., Zhao, X., Zai, J. & Li, X. J. Med. Virol. 
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Rolf Hilgenfeld 


Structural biologist Rolf Hilgenfeld 

has been trying to develop a cure for 
coronaviruses since the 2002-03 
outbreak of severe acute respiratory 
syndrome (SARS). Hilgenfeld, who is 
based at the University of Liibeck in 
Germany, is hoping to get into the locked- 
down city of Wuhan in China to test drug 
compounds in animals infected with the 
new coronavirus. He tells Nature about 
his quest. 


Why are you visiting China? 

After this virus emerged, | contacted 
collaborators in Wuhan. | have two 
compounds to test against the new virus, 
so lam seeking collaborators who have 
samples of the virus. 


At what stage of development are your 
compounds? 

We have been preparing them for testing in 
a mouse model of Middle East respiratory 
syndrome (MERS). In cell culture, we know 
they work against the SARS and MERS 
coronaviruses. 


Could they help to subdue the new virus? 
The problem with antiviral drugs is that 
when the compound is ready, there are no 
patients. After six months, we could have 
data showing that one of our compounds 
works against the new virus, and would be 
able to develop a drug. But if the outbreak 
is over, there will be no patients, so how 
can you do clinical trials? 


What do your compounds do? 

They are active against coronaviruses and 
a family of enteroviruses, which include 
hand, foot and mouth disease. Every year, 
half a million children get enterovirus-71, 
so we'd aim to go into clinical trials for 
these diseases. We can involve pharma. 

If we have something approved for 

those, we can use the drug in the next 
coronavirus outbreak. The compounds 
are directed at viral proteases, which have 
common features in coronaviruses and 
enteroviruses. 


Interview by David Cyranoski 
This interview has been edited for length 
and clarity. 
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Newsin focus 


Brexit is happening: 


what does it mean for science? 


News 


explainer 


At11 p.m. on 31 January, the United 
Kingdom will leave the European Union. 
Although an 11-month transition period 
means the milestone will bring few 
immediate changes for researchers, the 
moment marks the start of negotiations 
over the country’s future relationship with 
the bloc, including the United Kingdom's 
role in the EU’s next seven-year research 
programme, Horizon Europe, which will 
start on 1 January 2021 and is set to be 
worth €100 billion (US$110 billion). As 
Brexit begins, Nature takes a look at how 
2020 negotiations could shake out for 
science. 


What will change immediately after 11 p.m. 
on 31 January? 

The United Kingdom will enter a transition 
period that will freeze its relationship with 
the EU for the rest of the year. Until 2021, 
scientists will still be free to take up jobs 
and travel between the United Kingdom 

and other EU countries. British researchers 
will be able to apply for European research 
funding and take part in exchange schemes. 


Will the United Kingdom seek to join 
Horizon Europe? 

On 20 January, UK science minister Chris 
Skidmore told Parliament that he wanted 
the UK to ‘associate’ with Horizon Europe 

— a status that could allow UK scientists 

to participate on similar terms to those 

they experience today. But he stressed that 
association would depend on the final shape 
and content of the programme, which has 
yet to be agreed by European legislators. 
Wider negotiations — including agreements 
on immigration — are also likely to affect 
whether this kind of association is possible. 
Any such agreement would normally come 
only after countries have agreed a trade deal. 


What other factors will affect the United 
Kingdom's participation in Horizon Europe? 
Skidmore will have to convince the UK 
treasury that participation is worth the cost. 
The country has long taken more out of 

the EU research pot than it puts in, but that 
will almost certainly change, probably to a 
model in which the United Kingdom pays 
into the programme in line with what it 
receives. 
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An anti-Brexit protest outside the Houses of Parliament in London. 


How can the UK associate if freedom of 
movement between the United Kingdom 

and the EU ends? 

Under the existing Horizon 2020 research 
programme, Norway and Switzerland — both 
non-EU countries that wanted to associate — 
have to allow the free movement of people 
across their borders. This is something that 

the UK government has already ruled out. But 
the EU has suggested that it might be more 
flexible on the conditions of entry for the next 
programme. Beth Thompson, head of UK and 
EU Policy at the London-based biomedical- 
research funder Wellcome, says a UK system 
that guarantees mobility for researchers — such 
as the fast-track visa for researchers that the UK 
government announced on 27 January — could 
be enough to secure an agreement. The Global 
Talent visa route will open from 20 February, 
upgrading and expanding the underused 
‘exceptional talent’ visa and allowing the 
country’s main research funder, UK Research 
and Innovation, to endorse candidates. 


Can the scientific relationship between the 
United Kingdom and the EU be agreed by the 
end of the transition period on 31 December? 
It's possible, says Thompson, but it will be a 
challenge. A mock negotiation exercise carried 
out by Wellcome and the Brussels-based 
economic think tank Bruegel found that there 
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is only a “remote” chance that the United 
Kingdom and the EU will be able to agree a 
wider trade deal in 11 months. But the report, 
published on 28 January, concluded that it 
might be possible to agree an unprecedented 
standalone deal for science. For this to 
happen, both sides would need to agree to 
make an exception for science, for example 
by establishing special visas for researchers 
and setting up a system that allows UK 
organizations to opt in to EU data standards. 


What if the United Kingdom doesn’t 
manage to join Horizon Europe before the 
end of 2020? 

This would be disruptive. “Both the UK and 
EU will lose out,” warns Thompson. Without 
association, UK researchers would be able to 
take part in Horizon Europe only as members 
of a ‘third country’: they could participate 

in some EU-funded projects but not lead 
them, and only if the government agreed 

to pay their costs. If there is no wider trade 
deal, it could affect travel, data sharing and 
laboratory supplies. UK researchers will 
almost certainly need visas to take up jobs 

in EU nations, whether or not the United 
Kingdom can craft a new immigration 
system in time. 


By Elizabeth Gibney 
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THE BATTLE TO 
EMBED ETHICS 
IN AIRESEARCH 


Bias and the prospect of societal harm increasingly 
plague the field of artificial intelligence. 


By Elizabeth Gibney 


iversity and inclusion took centre 
stage at one of the world’s major arti- 
ficial-intelligence (AI) conferences in 
2018. But at last month’s Neural Infor- 
mation Processing Systems (NeurIPS) 
conference in Vancouver, Canada, attention 
shifted to another big issue in the field: ethics. 
The focus comes as Al research increasingly 
deals with ethical controversies surrounding 
the application of its technologies — such as 
in predictive policing or facial recognition. 
Problems include tackling biases in algorithms 
that reflect existing patterns of discrimina- 
tionin data, and avoiding disproportionately 
affecting already-vulnerable populations. 
“There is no such thing as a neutral tech plat- 
form,” warned Celeste Kidd, a developmental 
psychologist at the University of California, 
Berkeley, during her NeurIPS keynote talk 
about how algorithms can influence human 
beliefs. At the meeting, which hosted a record 
13,000 attendees, researchers grappled with 
how to meaningfully address the ethical and 
societal implications of their work. 
Ethicists have long debated the impacts of 
Aland sought ways to use the technology for 


good, suchas in health care. But researchers 
are now realizing that they need to embed 
ethics in the formulation of their research 
and understand the potential harms of algo- 
rithmic injustice, says Meredith Whittaker, 
an Al researcher at New York University and 
founder of the AI Now Institute, which seeks 
to understand the social implications of the 
technology. At the latest NeurIPS, research- 
ers couldn't “write, talk or think” about these 
systems without considering possible social 
harms, she says. “The question is, will the 
change in the conversation result in the struc- 
tural change we need to actually ensure these 
systems don’t cause harm?” 


Meetings matter 


Conferences suchas NeurIPS, which, together 
with two other annual meetings, publishes the 
majority of papers in Al, bear some responsi- 
bility, says Whittaker. “The field has blown up 
so much there aren’t enough conferences or 
reviewers. But everybody wants their paper 
in. So there is huge leverage there,” she says. 
But research presented at NeurIPS doesn’t 
face a specific ethics check as part of the 
review process. The pitfalls of this were encap- 
sulated by the reaction to one paper presented 
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Facial-recognition algorithms have been at the centre of privacy and ethics debates. 
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at the conference, in which faces — including 
aspects of a person’s age, gender and ethnic- 
ity — were generated on the basis of voices. 
Machine-learning scientists criticized it on 
Twitter as transphobic and pseudoscientific. 

One solution could be to introduce eth- 
ical review at conferences. For the first 
time, NeurIPS 2019 included a reproduci- 
bility checklist for submitted papers. In the 
future, once accepted, papers could also be 
checked for responsibility, says Joelle Pineau, 
amachine-learning scientist at McGill Univer- 
sity in Montreal, Canada, and at Facebook, who 
is on the NeurIPS organizing committee and 
developed the checklist. 

NeurIPS says that an ethics committee is on 
hand to deal with concerns during the existing 
review process, but it is considering ways to 
make its work on ethical and societal impacts 
more robust. Proposals include asking authors 
to make a statement about the ethics of their 
work. The organizers of the annual Interna- 
tional Conference on Learning Representa- 
tions — another of the major Al meetings — said 
it was discussing the idea of reviewing papers 
with ethics in mind. 

Al Now goes a step further: in a report 
published last month, it called for all 
machine-learning research papers to include 
asection on societal harms, as well as the prov- 
enance of their data sets. Such considerations 
should centre onthe perspectives of vulnera- 
ble groups, which Al tends to affect dispropor- 
tionately, Abeba Birhane, a cognitive scientist 
at University College Dublin, told NeurIPS’s 
Black in Al workshop. Developers should ask 
not only how their algorithm might be used, 
but also whether it is necessary, she said. 


Business influences 


Tech companies — which are responsible for 
vast amounts of Al research — are also address- 
ing the ethics of their work. But activists say 
that they must not be allowed to get away 
with ‘ethics-washing’. Tech firms suffer from 
a lack of diversity, and although some com- 
panies have staff and entire boards dedicated 
to ethics, campaigners warn that these often 
have too little power. Their technical solutions 
—whichinclude efforts to ‘debias algorithms’ 
— are also often misguided, says Birhane. The 
approach wrongly suggests that bias-free data 
sets exist, and fixing algorithms doesn’t solve 
the root problems in underlying data, she says. 
Forcing tech companies to include people 
from affected groups on ethics boards would 
help, said Fabian Rogers, acommunity organ- 
izer from New York City. Rogers represents 
a tenants’ association that fought to stop its 
landlord from installing facial-recognition 
technology without residents’ consent. “Con- 
text is everything, and we need to keep thatin 
mind when we're talking about technology. It’s 
hard to do that when we don’t have necessary 
people to offer that perspective,” he said. 
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THE CLIMATE LINK TO 
AUSTRALIA S FIRES 


Researchers race to decipher how global warming has influenced 
the record bush fires that have ravaged much of Australia. By Nicky 
Phillips and Bianca Nogrady 
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nl January, the air in Canberra was 
the worst of any city in the world. 
With unprecedented bush fires 
raging nearby, a thick blanket of 
smoke smothered Australia’s capi- 
tal for weeks, sending a surge of resi- 
dents to the hospital with breathing 
problems. The toxic haze got so bad 
that Sophie Lewis, a climate scientist at the 
University of New South Wales (UNSW) Can- 
berra, took her toddler and boarded a plane 
to Tasmania. 

“| almost wept with relief in Melbourne, on 
the way to Hobart, simply from seeing the sky,” 
she says. After weeks in the smoke, her daugh- 
ter had grown used to all the people walking 
around with “bird beaks”, Lewis’s name for the 
masks everyone was wearing. 

From Hobart, Lewis fielded e-mails from 
concerned colleagues overseas. Like the rest 


Haze blankets Canberra on 5 January 
as visitors walk by Australia’s 
parliament building. 


of the world, they were stunned by the scale 
and severity of the fires ravaging Australia (see 
‘A country aflame’). Since September, more 
than 10 million hectares have burnt — an area 
greater than the size of Austria — and the fire 
season doesn’t end for several monthsin some 
states. So far, the conflagrations have killed at 
least 32 people and destroyed more than 2,000 
homes across 3 states. Through it all, people 
have been asking Lewis: did climate change 
have a role in these catastrophic fires? 

Lewis and a handful of her collaborators 
were busy discussing that very question. 
They work ina small but growing field called 
attribution science, which calculates the 
likelihood that an extreme event such as a 
heatwave, a flood or a catastrophic bush-fire 
season was made worse by climate change. In 
a study published last December', Lewis and 
her colleagues linked catastrophic 2018 fires in 
northeastern Australia to climate change, and 
they are now planning an attribution study for 
the fires that have gripped large parts of the 
country over the past few months. 

The work is being led by researchers in 
Europe who have conducted multiple rapid 
analyses of global warming’s role in extreme 
events. The team first has to grapple with how 
it will define the fire event for the purpose 
of its study: it is tricky to model the various 
weather conditions that increase fire risk, and 
the blazes haven't yet died out. But once that 
is decided, the team could produce results as 
early as February. 

Coming up with answers will be difficult. 
“Fire is probably the most complex physical 
and societal system known,” says Tim Brown, a 
climatologist at the Desert Research Institute 
in Reno, Nevada. “There’re so many different 
aspects of it, from the fuels and the people to 
the management practices.” 

But Australia and other countries need to 
know what they are facing. If attribution stud- 
ies can quantify the role of climate change in 
particular extreme events, scientists can bet- 
ter forecast the chances that the catastrophes 
will strike again. Such information is vital for 
emergency-response managers as they pre- 
pare for a warmer Earth. Firefighters in many 
countries have noticed, for instance, that big 
blazes are getting hotter and more dangerous, 
so modelling studies of future risks would help 
them train for and respond to the conflagra- 
tions to come. 


Burning lands 


Australia has always had fires — catastrophic 
ones, too. The really devastating ones earn 
their own name, such as Black Friday in 1939, 
Ash Wednesday in 1983 and Black Saturday in 
2009. The last of those killed 173 people: the 
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continent’s deadliest fire on record. All three 
—aswellas the current crisis — happened amid 
or at the end of long, intense droughts. 

This year’s unusually hot and dry conditions 
are driven in part by a natural meteorological 
phenomenon called the Indian Ocean Dipole 
(IOD), which is defined by differences in sea 
surface temperatures across the ocean. In 
its positive phase, warmer waters congre- 
gate near Africa, and rainfall is reduced over 
the southern and most northerly regions of 
Australia. This year saw one of the strongest 
positive swings in the IOD in recent history. 
Coupled with these events was a shift in the 
polar winds above Antarctica — also a natural 
phenomenon, but much rarer than a positive 
IOD. This sudden stratospheric warming, as 
it is known, contributed to bringing hot, dry 
weather to much of Australia. On top ofall this 
natural variation, global warming is making 
the country even hotter and drier, says Sarah 
Perkins-Kirkpatrick, a climate scientist at 
UNSW Sydney. 

Evidence has been growing for decades 
that climate change will exacerbate Austral- 
ia’s fire seasons. A prescient paragraphina 
2008 government-commissioned climate 
report that compiled evidence from the pre- 
vious 30 years warned that fire seasons would 
start earlier, end later and be more intense’. 
“This effect increases over time, but should be 
directly observable by 2020,” noted the report, 
authored by Ross Garnaut, an economist at the 
University of Melbourne. 

Lewis says we don’t need attribution studies 
to say that climate change is generally mak- 
ing fires in Australia worse. But as extreme 
events become more frequent — and the pace 


“Fire is probably the most 
complex physical and 
societal system known.” 


of warming shows no signs of falling — people 
want to know whether climate change had a 
hand ina specific extreme event. 

Lewis’s study on the 2018 event looked at 
130 bush fires that razed nearly 750,000 hec- 
tares over 5 days. On one climate model, the 
researchers ran thousands of simulations of 
future conditions, and they compared a world 
with current greenhouse-gas concentrations 
against one with pre-industrial levels. Those 
runs suggest that climate change had made 
the extreme temperatures — a major driver of 
fire weather — 4.5 times more likely. A second 
model showed that the below-average rainfall 
was also linked to increased greenhouse-gas 
concentrations, but only in some climate 
scenarios. The researchers say the study is 
one of many that connect climate change to 
increasing fire risks in eastern Australia. The 
work helps to confirm what many suspect 
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about the impacts of the major warming in 
Australia, says Perkins-Kirkpatrick, one of 
the authors of the report. Nine of Australia’s 
ten hottest years on record have occurred in 
the past 15 years. 


Cause and effect 


Friederike Otto, a climate modeller at the Uni- 
versity of Oxford, UK, started contemplating 
an attribution study on the Australian fires 
after she saw satellite images peppered with 
conflagrations and smoke plumes stretching 
across the continent. The event was too big 
to ignore, says Otto, who is a co-investigator 
at World Weather Attribution (WWA), a part- 
nership led by the university’s Environmental 
Change Institute and the Royal Netherlands 
Meteorological Institute that analyses the 
effects of climate change on extreme weather. 
WWA decided to doarapid attribution study, 
and invited Lewis, Perkins-Kirkpatrick and 
other researchers in Australia to join. 

The first step in any attribution study is to 
set out the limits of the event (see ‘A country 
aflame’), which is tricky in the Australian case 
because of the size of the area that has burnt 
and the time span over which it happened, 
says Perkins-Kirkpatrick. Once that has been 
done, the team will analyse whether tem- 
perature, rainfall and a ‘fire-weather’ index 
(FWI) — which includes those two variables 
and others — during the event were outside 
normal ranges. Last year was the country’s 
driest and hottest on record, and a heatwave 
that affected most of the country in December 
smashed the record for the hottest day ever 
recorded in Australia. The average maximum 
temperature across the country reached 41.9 
°C on18 December. 

To see whether climate change had arolein 
these extremes, the group will use halfa dozen 
climate models to run thousands of simula- 
tions, some reflecting current greenhouse-gas 
concentrations and others using pre-industrial 
levels. The group will also determine whether 
climate change made fire weather worse dur- 
ing the event. 

Perkins-Kirkpatrick is confident the study 
will pinpoint the influence of climate change 
on extreme temperatures, but its effects on 
dryness, humidity and winds are much harder 
to assess. That’s why it’s important to analyse 
the extent to which global warming influenced 
boththe FWland the individual components, 
says Otto. 

The team plans to publish its results in an 
open-review journal, as soonasthey’re ready, 
and probably in the next couple of weeks. “For 
an event like this, where a lot of people havea 
lot of opinions on the role of climate change, 
it is important to make the scientific process 
as transparent as possible,” says Otto. 

The study could also feed into future 
attribution work on fires, for which there 
has been a shortage of work. Hundreds of 
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attribution studies have shown that climate 
change increased the risks of specific heat- 
waves — including a record one in Europe last 
year. But only asmall fraction have looked at 
extreme fires, partly because fires are much 
more complex than heatwaves or droughts, 
says Brown. A report examining major firesin 
British Columbia in Canada in 2017 found that 
climate change made extreme fire weather 
two to four times more likely and increased 
the area of the province that burnt by at least 
a factor of seven®. Andacouple of studies have 
explored the factors driving a fivefold increase 
inthe area burnt in California since the 1970s*, 
and a twofold increase in burnt area in the 
western United States since the mid-1980s°. 
Both studies found that the particular trend 
was probably driven by increased drying of 
leaves, twigs, tree branches and other ‘fuels’ 
as aresult of global warming. 


Incendiary behaviour 


Most fire-attribution studies have focused 
on answering relatively straightforward 
questions, such as how much climate change 
contributed to, or exacerbated, the event. 
But Brown, whose team specializes in study- 
ing fire, wants to look deeper, and investigate 
how climate change is altering the behaviour 
of fires. In particular, he and his colleagues 
are looking at night-time warming, a factor 
he thinks might link global warming to bush- 
fire risk. When temperatures drop sharply at 
night, humidity tends to increase and that 
can help firefighters to suppress blazes. But 
when overnight temperatures remain high, 
fire managers have less success in combating 
fires, he says. Night-time temperatures have 
been climbing around much of the globe®, 
and Brown is exploring whether that change 
is raising the risk of fires. 
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Scientists are also interested in examining 
whether fires are getting more severe. The 
increased fuel aridity makes fires burn hot- 
ter, which increases the chances that a blaze 
will create its own weather system, sparking 
lightning and throwing embers kilometres 
ahead of the fire front’®. 

Smoke from these events can be so thick 
that it turns the sky an eerie red, or plunges 
everything into darkness. The haze travels 
for hundreds of kilometres, and can be seen 
from space. Lewis worries there isn’t enough 
attention on the health impacts for the mil- 
lions of Australians who’ve endured months of 
thick smoke. Beyond the damage to people’s 
lungs, the fires can take a psychological toll. 
When residents are stuck indoors for weeks, 
Lewis says, the smoke “makes you feel stressed 
and anxious and on edge. Everything smells 
of smoke.” 

Lewis and her family stayed in Tasmania for 
almost two weeks. Now back in Canberra, she’s 
seeing the effects this wild summer has had 
on her toddler, who has started asking where 
the red Sun went and what happened toallthe 
bird-beak masks. 


Nicky Phillips is Nature's Asia-Pacific bureau 
chief. Bianca Nogrady is a freelance writer in 
Australia’s Blue Mountains. 
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Isaac Asimov: centenary 
of the great explainer 


The chemist and science-fiction visionary championed 
rationality in 20 million published words. By David Leslie 


n 24 December 1968, astronaut 

William Anders took a photograph of 

Earth from the observation window 

of Apollo 8. The picture, now known 

as Earthrise, became one of the most 
iconic images inhuman history — later credited 
with catalysing the environmental movement. 
Is Earthrise a product of science or a work of 
art? Isaac Asimov, the Russian-born chemist 
and science-fiction colossus (1920-92), had 
an answer: the two are, in fact, the same. 

Asimov spent more than half of the 
twentieth century cultivating that transform- 
ative unity of art and science. He wrote and 
edited around 500 books and penned myriad 
stories, articles and essays. They spanned the 
rich microscopic worlds of cytoplasm, cells 
and subatomic particles, and ventured into the 
boundless wilds of space. Throughout, Asimov 
razed the make-believe boundary between 
imagination and reason. As he wrote in the 
gemlike 1978 essay ‘Art and Science’, the art- 
ist’s work suffers if knowledge is deficient; the 
scientist’s suffers if leaps of intuition, which so 
often outpace the leaden trot of rationality, are 
ignored. Advance in these arenas is often syn- 
ergistic, and scientists can “make great leaps 
into newrealms of knowledge by looking upon 
the universe with the eyes of artists”. 

For Asimov, nurturing ingenuity and insight 
through exploration, learning and communi- 
cation was an ethical imperative and crucial for 
human progress. It was an ethos he promoted 
through 20 million printed words. 


Life of wonder 


Asimov was central to science fiction’s Golden 
Age, as the writer of iconic works such as /, 
Robot (1950), the Foundation series and The 
Gods Themselves (1972). As a scientist, he was 
a popularizer who often drew comparisons 
with H. G. Wells. The leitmotif of his life was an 
unstinting thirst for knowledge. In Asimou’s 
New Guide to Science (first published as The 
Intelligent Man's Guide to Science in 1960), he 
wonderfully characterized the deep history of 
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life on Earth as an adventure of curiosity. The 
book takes us from the clumsy inquisitiveness 
of an upstart paramecium searching for food 
several hundred million years ago to the rest- 
less seeking that propelled big-brained Homo 
sapiens into the space age. That brain, the 
“most magnificently organized lump of matter 
inthe knownuniverse’”, generates the “curiosity 
surplus” shaping human culture, he wrote. 
Asimov’s curiosity was first sparked in the 
windowless back rooms of a string of Brooklyn 
sweet shops. Born in 1920 in Petrovichi in 
Soviet Russia, Asimov was three when he and 
his family arrived at Ellis Island and began to 
scrape a living in New York. Three years later, 
his father had saved enough money to set up 
the first shop. Young Isaac spent long days 
delivering newspapers, schlepping boxes 
and magazines — and reading voraciously. 
By the age of five, he had taught himself to 
read by studying street signs. At six, he got 
his first library card from the Brooklyn Public 
Library; he would eventually wangle a sec- 
ond one from the neighbouring borough of 


The Early 


Asimov 
volumet 


Queens, doubling his weekly intake. By his 
early teens, Asimov was horrifying neighbours 
as he wended his way blindly down busy streets 
with his nose in a book and one more under 
each arm. 

Asimov started reading science-fiction at 
nine, justas the genre had begunajourney from 
pulp extravagance toa more science-centred 
era. He had convinced his father that Hugo 
Gernsback’s magazine Science Wonder Stories 
contained serious stuff, despite the covers’ 
motley depictions of space ships and aliens. 
From then on, he would regularly abscond to 
the shop’s storeroomto immerse himself. Thus 
began a lifelong habit of exploring the open 
frontiers of possibility in enclosed, electric-lit 
spaces — not unlike the time-travelling Andrew 
Harlan (in his 1955 novel The End of Eternity), 
who zooms across thousands of centuries in 
ahumming “kettle”. 


The flag of reason unfurled 


Aprodigy, Asimov graduated from highschool 
at 15. He was, however, rejected by Columbia 
College in Manhattan and directed to Seth Low 
Junior College, a satellite school in Brooklyn. 
Anti-Semitism was at work. Undeterred, he 
plodded through his studies. Meanwhile, a 
distinctive vision of science fiction as “the liter- 
ature of social change” took shape in his mind. 
In 1938, he joined a remarkable sci-fi fan club, 
the Futurians, along with authors Frederik 
Pohl, Cyril Kornbluth and Donald Wollheim. 
The group was progressive and political, 
opposing the interwar rise of barbarism and 
militaristic ideologies. It called for science fic- 
tion to raise “the torch of science” and unfurl 
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Isaac Asimov published around 500 books, including science-fiction and popular science. 


© 2020 Springer Nature Limited. All rights reserved. 


SCOTSTOCK/ALAMY 


ARNOLD NEWMAN PROPERTIES/GETTY 


it i 


Ant 


Isaac Asimov in 1982 on Ellis Island, New York. 


“the flag of reason’ to foster amore united and 
humane society. 

Although he embraced this vision, Asimov 
felt that an impetus to change needed to be 
grounded in an understanding of its histori- 
cal complexities and enabling conditions. He 
became a student researcher for sociologist 
Bernhard J. Stern, aco-founder of the journal 
Science and Society who was completing a 
book on the social resistance to technolog- 
ical change — Society and Medical Progress, 
published in 1941. Asimov began to perceive 
technology as a social institution entangled 
with others, suchas politics and the economy. 

He realized that the processes of innovation, 
which make possible both science fiction and 
technological progress, are in struggle with 
the norms and institutions that constrain 
them. He saw that the modern “era of change” 
was the crucible for this struggle, as scientists 
and artists envisioned and created futures and 
realities radically different from the past. 

This conflict became Asimov's creative 
launchpad. Science fiction in his hands 
dove headlong into the eye of the societal 
storm, where future-facing human ingenuity 


“Science fiction in his 
hands dove headlong 
into the eye of the 
societal storm.” 


confronts the prevailing cultural patterns 
and historical conditions that determine the 
possibilities of the present. 


Tothe stars 


Asimov’s star beganits spectacular rise. While 
still an undergraduate, he published his first 
sci-fi piece, ‘Marooned Off Vesta’, in the mag- 
azine Amazing Stories. The freshly shaven 
18-year-old Asimov had also shown up, man- 
uscript in hand, at the door of the legendary 
editor of Astounding Science Fiction, John 
W. Campbell. Although Campbell gracefully 
turned down the story, he became Asimov's 
greatest literary mentor. Their relationship 
ended up tense and complicated, however, 
with Asimov explicitly rejecting Campbell’s 
fascist politics and belief in racial superior- 
ity. (It must be noted here that Asimov had his 


own egregious behaviour: the unapologetic 
harassing of women.) 

With Robert Heinlein and Arthur C. Clarke, 
among others, Asimov ushered in a turn 
towards technically sophisticated science 
fiction, inspiring readers to move beyond 
the escapist era of ray guns and romance to 
critically embrace the computer revolution 
and the coming space age. 

Asimov's annus mirabilis spanned 1941 and 
1942. He published ‘Nightfall’ in Astounding, 
a metaphor for humankind’s devastating 
psychological confrontation with its own 
cosmic insignificance. In 1968, the Science 
Fiction Writers of America voted it one of the 
greatest sci-fi stories ever penned. He also pro- 
duced several of his celebrated robot stories 
— suchas ‘Reason, ‘Liar’ and ‘Runaround’—in 
which he coined the term robotics (after the 
robots of R.U.R (Rossum’s Universal Robots), a 
play by Czech writer Karel Capek, published 
100 years ago) and famously invented the 
field’s three ethical laws. Those accomplish- 
ments are honoured every April inthe United 
States, during National Robotics Week. 

And 1942 saw the publication of ‘Foundation’, 
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the inaugural instalment of Asimov’s epon- 
ymous series and a masterwork of future 
history modelled on Edward Gibbon’s eight- 
eenth-century classic, The History of the Decline 
and Fall of the Roman Empire. \n the series, 
humanity battles against a galactic descent 
into 30,000 years of barbarism, foretold by 
the statistical science of ‘psychohistory’. The 
epicis rife with the struggles of human agency 
against the “dead hand of social inevitability” 
— themes still unnervingly relevant in today’s 
algorithm-driven ‘big data’ society. In 1966, the 
Foundation trilogy won a Hugo Award for the 
best science-fiction series ever written. 


Responsibility of communication 


The next decade was a busy one. Asimov 
worked as a scientist for the US Navy during 
the Second World War; completed a doctor- 
ate in chemistry at Columbia; and, in 1949, 
took a post teaching biochemistry at Boston 
University School of Medicine in Massa- 
chusetts. Even as his science-fiction career 
gathered pace, his academic role nudged 
him towards science writing, beginning 
with the co-authored 800-page Biochem- 
istry and Human Metabolism in 1952. Six 
books on chemistry followed, along with a 
run of sparkling popular-science essays with 
beguiling titles such as ‘The Explosions Within 
Us’ and ‘The Abnormality of Being Normal’. 

On4 October 1957, the world was galvanized 
as the Soviet Union propelled the 84-kilogram 
satellite Sputnik 1 into elliptical Earth orbit. 
In the United States, panic rose over the gap 
in research progress. As US citizens turned 
on short-wave radios to listen for its eerie 
‘beep-beep’, Sputnik mania sparked a massive 
demand for educational popular-science writ- 
ing — anda shift towards a more unified vision 
of Earth. Nature ended and Ecology was born, 
the media theorist Marshall McLuhan later 
noted: “The planet became a global theater in 
which there are no spectators but only actors. 
On Spaceship Earth there are no passengers; 
everybody is amember of the crew.” 

Asimov plunged into both educating the 
public and advocating a sustainable future. 
In 1958, he switched to mainly writing science 
books. The following year, he wrote halfa mil- 
lion words in eight months, culminating in The 
Intelligent Man’s Guide to Science, nominated 
for the 1960 National Book Award. 

Across his non-fiction, Asimov dedicated 
himself to initiating lay readers into the world 
of scientific exploration through unadorned 
but wonder-filled writing. The aim was to allow 
them to weigh up the prospects and challenges 
of scientific advance lucidly and capably. 
Astronomer and science communicator Carl 
Sagan’s obituary of Asimovin Nature calls him 
“one of the great explainers of the age”, moti- 
vated by a “profoundly democratic impulse” 
(C. Sagan Nature 357, 113; 1992). 

Asimov himself dubbed that impulse the 


616 | Nature | Vol577 | 30 January 2020 


Will Smith in Alex Proyas’s film I, Robot (2004), based on Asimov's fiction. 


“responsibility of communication”. Ina1988 
interview, he explained that it rested on the 
fact “that science is tentative, that it is not cer- 
tain, that it is subject to change”. Therefore, no 
voice can have the last word, and open, inclu- 
sive and rational communication is the only 
option for advancement. The public, mean- 
while, bears the “responsibility of listening”. 
These strains of Enlightenment thinking 
made hima staunch critic of pseudoscience, 


“Asimov wrote: ‘Science 
gathers knowledge faster 
than society gathers 
wisdom.” 


intolerance and superstition. He spent a good 
part of his final years championing secular 
reason in an age when, he believed, the candle 
of critical thinking had dimmed. In 1988, he 
observed: “the saddest aspect of life right now 
is that science gathers knowledge faster than 
society gathers wisdom”. 


The good Earth is dying 


Asimov recognized that this troubling 
disconnect had grave planetary consequences. 
Inhis perceptive 1971 essay ‘The Good Earth Is 
Dying’, he pointed out that, with accelerating 
technological advancement, harmful impacts 
on society and the environment had already 
reached a global scale. Meanwhile, attitudes 
suchas the reverence for limitless growth and 
the embrace of tribalism persisted — dictated 
by antiquated parochial beliefs and local con- 
ventions, rather than being transposed ina 
global key and altered accordingly. Asimov 
wrote: “What was common sense in a world 
that once existed has become myth in the 
totally different world that now exists, and 
suicidal myth at that”. 
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Many decades before the rise of idioms of 
anxious sociotechnical reflection suchas ‘the 
sixth extinction’, or the ‘Anthropocene epoch’, 
Asimov was writing about how “anthropo- 
genic processes” were poisoning the planet. 
From the 1970s onwards, he wrote about how 
atmospheric pollution was destroying the 
ozone layer, acidifying the oceans and exac- 
erbating the scale of natural catastrophes. He 
described human-driven global warming and 
its probable effects on the biosphere. He noted 
how overpopulation and the sweep of reck- 
less human activities were prompting a “great 
die-off” of species at an unprecedented rate. 

Asimov called for global solutions to global 
problems, anchored in a vision of humanity 
united by acommon aim. That stance drove 
him from the start. More thana decade before 
Earthrise so stirringly revealed our planetary 
home, Asimov, inthe 1953 essay ‘Social Science 
Fiction’, reflected on howliterature portrayinga 
multitude of worlds puts ours into perspective. 
Inaconceptually vast Universe, Earth shrinks, 
to beneficial effect: subdivisions that ordinarily 
polarize can become much harder to perceive 
or conceive, and humanity can be seenas faced 
with “common dangers and common tasks”. 

A century after Asimov’s birth, forests 
burn from Australia to California. Shorelines 
are swallowed by rising seas, towns ravaged 
by unearthly storms. Humanity’s insatiable 
appetites continue to crush the diversity of 
life, and conflicts draw us ever closer toa fiery 
end. At such ajuncture, we might do well to 
pick up Asimov's writings and take flight with 
him. Perhaps then we can together peer back 
at our pale blue island, suspended in the void, 
and gain a saner, more humane and more 
rational point of view. 


David Leslie is the Ethics Fellow at the Alan 
Turing Institute in London. 
e-mail: dleslie@turing.ac.uk 
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Correction 

Isaac Asimov: centenary of the great 
explainer 

This article should have noted that Asimov had 
his own egregious behaviour: his unapologetic 
harassing of women. 

See https://doi.org/10.1038/d41586-020- 
00176-4 
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Swift visas for 
post-Brexit science 


As the UK government’s chief 
scientific adviser, |welcome 
the announcement of anew 
fast-track immigration scheme 
for researchers to help ensure 
that the United Kingdom 
remains atop destination for 
scientific talent after it leaves 
the European Union later this 
year. The scheme — which is 
being incorporated within a 
reformed and rebranded Global 
Talent Route — will go live on 
20 February. 

The fast-track scheme 
applies to all eligible overseas 
researchers and their team 
members who receive 
peer-reviewed grants from 
recognized funding bodies. 
The national funding agency, 
UK Research and Innovation, 
will oversee the eligibility of 
funding bodies and establish 
anewcriterion for automatic 
endorsement. Dependants will 
continue to have full access to 
the labour market. There will 
also be an accelerated path to 
settlement. There is no cap on 
the number of researchers who 
can benefit. 

The scheme will allow 
UK-based researchers to recruit 
overseas talent to their teams. 
Attracting the best international 
scientists at all career stages 
is animportant part of the 
government's strategy to boost 
research and development. This 
first phase of changes goes a 
long way towards ensuring that 
the United Kingdom remains 
a global leader in science 
excellence. 


Patrick Vallance Government 
Office for Science, London, UK. 
contact@go-science.gov.uk 


Crop revolutions 
must reach poor 


As researchers who recognize 
that plant science underpins 
food security, we applaud 

the development of genetic 
strategies that could sustainably 
improve crop yields (J. Bailey- 
Serres et al. Nature 575, 109-118; 
2019). However, the socio- 
economic implications of such 
technologies could prevent 
farmers and consumers from 
adopting them, particularly in 
developing countries hit hardest 
by climate change. 

Commercial interests 
typically drive the 
implementation of crop 
technologies. Advances in 
crop science are more about 
integrating technology with 
global economic realities, 
which can include poverty, poor 
governance, lack of market 
access and inefficient supply 
chains (A. A. Adenle et al. Nature 
Biotechnol. 36, 137-139; 2018). 

Taking such limitations 
into account, along with 
public unease about genetic 
modification and the use of 
the gene-editing tool CRISPR 
in food production (J. L. Lusk 
et al. Food Pol. 78, 81-90; 2018), 
we consider that a broader 
range of technologies should be 
explored in parallel. Innovative 
approaches to plant breeding 
strategies, for example, could 
deliver a new green revolution. 


Christine H. Foyer* University 
of Birmingham, UK. 
c.h.foyer@bham.ac.uk 

*On behalf of 5 correspondents; 
see go.nature.com/38hcijjv. 
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Research funding 
gloom for Catalexit 


I disagree with Joan Martinez 
Alier’s view that Catalonia’s 
research funding could increase 
if the region were to become 
independent of Spain (Nature 
576, 384; 2019). For astart, 
Catalonia would lose out on 
future European research 
grants because it would cease 
to bea member of the European 
Union. And, contrary to Alier’s 
suggestion, discontinuing 
fiscal transfers to Spain would 
make little difference in its 
landscape of bulging public 
debt, departing businesses, and 
no access to European Central 
Bank financing. 

Madrid has acomparable 
weight in Spain’s economy. 
Although fiscal transfers from 
Madrid to the rest of Spain are 
much higher than those from 
Catalonia, its absolute and 
per capita public debt are less 
than half those of Catalonia. 
Since 2012, Catalonia has 
drawn more than €70 billion 
(US$78 billion) in favourable- 
term loans from Spanish public 
sources such as the Regional 
Liquidity Fund. To make 
matters worse, an independent 
Catalonia might need to add its 
share of Spanish public debt — 
around another €200 billion 
— toits regional debt (see 
go.nature.com/37kqclc). 


José M. Rojo Margarita Salas 
Biological Research Center, 
CSIC Madrid, Spain. 
jmrojo@cib.csic.es 
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Genentech not first 
biotech company 


In his illuminating history of 
corporate research, Paul Lucier 
repeats the common mistake 
of calling Genentech the first 
biotechnology firm (Nature 
574, 481-485; 2019). Cetus was 
founded five years earlier, in 
1971, by Nobel-prizewinner 
Donald Glaser and others. It 
initially developed microbial 
processes for producing 
chemical feedstocks, including 
propylene oxide and antibiotic 
intermediates. The corporation 
later pivoted to therapeutics. 
Genentech was backed 
by venture capital. Cetus 
was funded largely by other 
means, including support from 
Standard Oil. Consequently, 
Cetus and a few other early 
biotech companies — Irvine 
Scientific, Gamma Biologicals 
and Cell Associates among them 
— have been overlooked or long 
forgotten because of a history 
that equates biotech with 
venture-capital-funded drug 
discovery. 


William Bains Five Alarm Bio, 
Cambridge, UK. 
william@fivealarmbio.com 
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Stem cells 


Fight or flight 


turns hair white 


Shayla A. Clark & Christopher D. Deppmann 


Signalling from the sympathetic nervous system of mice 
when subjected to stress leads to the depletion of a stem-cell 
population in their hair follicles. This discovery sheds light on 
why stress turns hair prematurely grey. See p.676 


It has been said that Marie Antoinette’s hair 
went completely white on the night before her 
beheading. This story might be apocryphal, 
but rapid greying of the hair is now widely 
referred to as Marie Antoinette syndrome. 
It is often assumed to be caused by stress — 
a phenomenon perhaps best exemplified by 
photographs of heads of state before and 
after they held office. However, the relative 
contributions of ageing, genetic factors and 
stress to greying are not known — in part owing 
to alack of mechanistic understanding of the 
process. On page 676, Zhang et al.'identify the 
mechanism governing premature greying in 
mice that have experienced stress. 

The average human scalp has 100,000 hair 
follicles, and a wide range of hair colours can 
be found across the human population. Hair 
colour is determined by cells called melano- 
cytes, which produce different combinations 
of light-absorbing melanin pigments”. Melano- 
cytes are derived from melanocyte stem cells 
(MeSCs), which are located ina part of the hair 
follicle called the bulge®. The normal hair cycle 
is divided into three stages: hair-follicle regen- 
eration (anagen), degeneration (catagen) and 
rest (telogen). Melanocyte production begins 
early in the anagen phase (Fig. 1a). As people 
age, the pool of MeSCs is gradually depleted 
— and so pigmented hair becomes ‘salt and 
pepper’ coloured, and then turns to grey 
and finally to white after a complete loss of 
pigment in all hair follicles’. 

Aside from ageing, there are several factors 
that bring about premature greying, includ- 
ing dietary deficiencies’, disorders such as 
alopecia areata or vitiligo®’, and stress®’. 
Zhang et al. set out to test the role of stress 
in the greying process in mice. They exposed 
the animals to three different stressors — 
pain, restraint and a model of psychological 


stress — during different phases of hair growth. 
Eachstressor caused depletion of MeSCs from 
the bulge region, eventually leading to the 
development of patches of white hair. 
Prevailing theories posit that stress-induced 
greying involves hormones (such as cortico- 
sterone) or autoimmune reactions”. Zhang and 
colleagues examined these potential mech- 
anisms, first by preventing corticosterone 


a Normal 


> Anagen ——, New anagen 


Melanocyte 


Catagen, 
telogen 


Sympathetic neuron 


signalling and next by stressing animals 
that had compromised immune systems. In 
both cases, greying occurred after stress, 
indicating that neither corticosterone nor 
autoimmune reactions cause MeSC deple- 
tion. However, the authors found that MeSCs 
express B,-adrenergic receptors, which 
respond to noradrenaline — a neurotransmit- 
ter molecule involved in the ‘fight or flight’ 
response to stress. Loss of this receptor 
specifically in MeSCs completely blocked 
stress-induced greying. 

Adrenal glands are the main source of 
circulating noradrenaline. But, surprisingly, 
the researchers discovered that removing 
these glands did not prevent greying in 
response to stress in the mice. 

Another source of noradrenaline is the 
sympathetic nervous system (SNS), which is 
highly active in response to stress, and which 
drives the fight-or-flight response. Zhang and 
colleagues showed that bulge regions are 
highly innervated by sympathetic neurons, 
and that ablating the SNS using a neurotoxin 
molecule, or blocking the release of noradren- 
aline from sympathetic neurons, prevented 
stress-induced greying. Next, the authors 
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Figure 1|Melanocyte stem cells and stress. Melanocyte stem cells (MeSCs) are located in the bulge 

of the hair follicle, which is innervated by neurons of the sympathetic nervous system that release the 
neurotransmitter molecule noradrenaline. The follicle cycles through three phases: regeneration (anagen), 
degeneration (catagen) and rest (telogen). a, Under normal conditions, MeSCs migrate away from the bulge 
(red arrows) and differentiate into melanocytes during anagen. Melanocytes synthesize pigments that add 
colour to the regenerating hair. During catagen and telogen, they begin to die and migrate out of the niche 
(not shown). However, plentiful MeSCs remain to replace the melanocytes in the next anagen phase. b, Zhang 
etal.’ show that stressful stimuli activate the sympathetic nervous system, increasing noradrenaline release 
in hair follicles. Noradrenaline causes complete conversion of MeSCs into melanocytes, which migrate out 

of the niche in catagen and telogen. The hair follicle is depleted of MeSCs that would have differentiated to 
replace these melanocytes. Without any pigment cells to colour the hair in the next anagen phase, it begins to 


look grey or white. 
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generated mice in which sympathetic neurons 
could be acutely activated, and found that 
overactivation of the SNS in these mice caused 
greying in the absence of stress. Together, 
these results indicate that noradrenaline 
released from active sympathetic neurons 
triggers MeSC depletion (Fig. 1b). Interest- 
ingly, Zhang et al. found that the propensity 
of an area to turn grey correlates with its 
level of sympathetic innervation. 

Exactly how does sympathetic activity 
cause depletion of MeSCs from hair follicles? 
Normally, these stem cells are maintainedina 
dormantstate until hair regrowthis required. 
However, when the researchers tracked MeSCs 
labelled with a fluorescent protein, they 
discovered that MeSC proliferation and dif- 
ferentiation increase markedly under extreme 
stress or exposure to a high level of nor- 
adrenaline. This results in mass migration of 
melanocytes away from the bulge, and leaves 
no remaining stem cells. To further confirm 
this result, the researchers suppressed MeSC 
proliferation pharmacologically and genet- 
ically. When proliferation was dampened, 
the effects of stress on MeSC proliferation, 
differentiation and migration were blocked. 

Zhang and colleagues’ work raises several 
questions. For instance, is the mechanism 
underlying MeSC depletion in response to 
stress the same as that which causes greying 
during ageing? Future experiments modulat- 
ing SNS activity over a longer period would 
determine whether age-related greying can be 
slowed or hastened. Perhaps, inthe absence of 
sympathetic signals, MeSCs have the capacity 
for unlimited replenishment, pointing to a way 
to delay age-related greying. 

Are other pools of stem cells similarly 
susceptible to stem-cell depletion in response 
to stress, ifthey or the cells that make up their 
niche express B,-adrenergic receptors? In 
support of this idea, haematopoietic stem 
and progenitor cells (HSPCs), which give 
rise to blood and immune lineages, reside in 
a bone-marrow niche that contains stromal 
cells, and stimulation of those cells by the SNS 
causes HSPCs to leave their niche”. Perhaps, 
like MeSCs, stress depletes HSPCs — which 
could partially explain why immune function 
is impaired in response to chronic stress®™, 
Whether this type of relationship extends 
beyond MeSCs and HSPCs is an open question. 

It is fascinating to consider what possible 
evolutionary advantage might be conferred 
by stress-induced greying. Because grey hair 
is most often linked to age, it could be associ- 
ated with experience, leadership and trust”. 
For example, adult male silverback mountain 
gorillas (Gorilla beringei beringei), which get 
grey hair on their backs after reaching full 
maturity, can go on to lead a gorilla troop”. 
Perhaps an animal that has endured enough 
stress to ‘earn’ grey hair has a higher place 
in the social order than would ordinarily 
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be conferred by that individual's age. 

Connecting the dots between stress, fight 
or flight, stem-cell depletion and premature 
greying opens up several avenues for future 
research. Beyond developing anti-greying 
therapies, Zhang and colleagues’ work 
promises to usher in a better understanding 
of howstress influences other stem-cell pools 
and their niches. 
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Suspect that modulates 
the heartbeat is ensnared 


Xiaohan Wang & Richard W. Tsien 


The activity of calcium channels in the heart increases during 
what is called the fight-or-flight response. An investigation 
into the 50-year-old mystery of how this occurs has captured a 
previously overlooked suspect. See p.695 


In the Sherlock Holmes tale The Adventure 
of the Dancing Men, the detective runs a 
heart-pounding race to try to save his client’s 
life. The thumping of the sleuth’s heart — a 
literary example of the ‘fight-or-flight’ effect! 
— reflects the changes that occur when the 
entry of calcium ions into the heart rises”. 
On page 695, Liu et al. provide a solution to 
the long-standing riddle of how this occurs, 
through deductions worthy of Sherlock 
Holmes. 

Someaspects of howcalcium enters the heart 
during a fight-or-flight response are known. 
The process is mediated by the hormone adren- 
aline acting on B-adrenergic receptors — pro- 
teins that reside in the surface membrane of 
heart cells called cardiomyocytes. Receptor 
activation leads to an increase in the opening of 
whatis called an L-type voltage-gated calcium 
channel. This occurs through a mechanism that 
involves the molecule cyclic AMP (cAMP)** and 
an enzyme called protein kinase A (PKA) that 
requires cAMP for its function®. Similar types 
of PKA-mediated processes are found in other 
contexts. For example, some neurons use cAMP 
and PKA to enhance calcium entry through 
L-type calcium channels’. 


© 2020 Springer Nature Limited. All rights reserved. 


Exactly how the stimulation of B-adrenergic 
receptors modulates calcium-ion influx has 
been debated since the 1970s. Researchers 
have uncovered tantalizing clues to the identity 
of the target molecule that PKA modifies by 
the addition of a phosphate group (phospho- 
rylation). However, proposals for specific 
candidate targets, phosphorylation sites® ° 
and modulatory mechanisms have been 
repeatedly called into question by further tests, 
including some in painstakingly constructed 
mouse models”. Liu et-al. use a powerful 
technique called proximity proteomics" to 
implicate a previously under-appreciated 
suspect” and to establish its role. 

L-type voltage-gated channels provide 
a route by which calcium ions enter cardio- 
myocytes to help trigger a heartbeat. If 
channel opening is boosted, this results ina 
stronger and faster heartbeat. Previous inves- 
tigations of how PKA might modulate channel 
opening focused mainly on amino-acid res- 
idues in the channel that occur in structural 
motifs possibly phosphorylated by PKA. But 
when Liu and colleagues performed a tour- 
de-force experiment in mice in which the 
channel was engineered so that all candidate 


phosphorylation sites were converted to an 
amino acid (alanine) that can’t be phospho- 
rylated, and when this channel was studied 
alone, PKA-mediated enhancement of L-type 
channels nevertheless persisted. The authors 
therefore looked elsewhere for the elusive 
mediator of PKA’s ability to regulate the 
fight-or-flight effect. 

Reasoning that some unknown factor must 
come into close proximity to the calcium 
channel during this regulatory process, the 
authors conducted a systematic search. Using 
proximity proteomics, Liu and colleagues 
engineered channel subunits to contain an 
enzyme that adds a tag called biotin to any 
protein within a radius of approximately 
20 nanometres”. Tagged proteins were then 
identified by mass spectrometry. Hundreds 
of proteins in proximity to the calcium chan- 
nel were analysed, and the authors found that 
the protein Rad was enriched in the channel 
microenvironment under resting condi- 
tions, but was noticeably depleted during 
stimulation of the B-adrenergic receptor. 
This dovetailed with an earlier clue — Rad is 
known to inhibit L-type voltage-gated cal- 
cium channels”, and, in mice, deletion of 
the gene that encodes Rad mimics the effect 
of B-adrenergic stimulation and eliminates 
further adrenaline-mediated enhancement 
of the activity of L-type channels”. 

Liu et al. investigated whether PKA could 
prevent Rad-mediated channel inhibition. 
The authors tested whether phosphoryla- 
tion of amino-acid residues on Rad would 
enable it to move away from the vicinity of the 
calcium channel. They narrowed the candidate 
residues downto four serines (in some experi- 
ments, just two), which, if replaced by alanine, 
abolished PKA-mediated regulation of 
calcium entry. 

The calcium channel’s B-subunit was the 
prime suspect as the target of Rad inhibi- 
tion. Ablation of the interaction between 
the calcium channel’s a,,-subunits and its 
B-subunits fully eliminates PKA-mediated 
modulation of channel activity”. Indeed, the 
authors’ measurements, using a technique 
called fluorescence resonance energy transfer, 
showed that the interaction between Rad and 
the calcium-channel B-subunit was inhibited 
by PKA phosphorylation of the key serines in 
Rad that the authors had identified. Further 
tightening the noose around Rad’s metaphor- 
ical neck, electrical recordings demonstrated 
that all of the biophysical fingerprints of mod- 
ulation by B-adrenergic signalling — such as 
the activity of previously inactive calcium 
channels and a shift in the voltage depend- 
ence of their activation’ — were prevented 
by eliminating Rad phosphorylation. 

The results make a compelling case for the 
following scenario (Fig. 1). Adrenaline binds 
and activates the B-adrenergic receptor. This, 
inturn, results in the activation of an enzyme 


a Lowactivity of the 
calcium channel es @ 


B-adrenergic 
receptor 


Membrane 


Cytoplasm of 
cardiomyocyte 


Inactive PKA 
jee 


( 


b High activity of the 
calcium channel 


Adrenaline 


@ = 
cAMP 
Activated PKA Me 


Figure 1| Modulation of the cardiac calcium channel. In heart cells called cardiomyocytes, the activity 

of calcium-ion channels increases during what is called the fight-or-flight response. Activation of the 
enzyme protein kinase A (PKA) is required for this process, and, in mouse studies, Liu et al.> reveal that its 
elusive target is the protein Rad. a, In the absence of a fight-or-flight response, the B-adrenergic receptor is 
not stimulated and PKA is inactive. Rad binds to a subunit of the calcium channel (beige; only the a,-- and 
B-channel subunits are shown) and calcium-ion (Ca?*) entry into cardiomyocytes is low. b, During the fight- 
or-flight response, the hormone adrenaline activates the B-adrenergic receptor. This leads to the production 
of cyclic AMP (cAMP) molecules, which activate PKA. Activated PKA adds a phosphate group (P) to Rad, 
causing Rad to dissociate from the channel and enabling channel activity to increase. This elevation of Ca?* 


in the cytoplasm boosts the heartbeat. 


that produces cAMP, which activates PKA. 
PKA phosphorylates Rad and causes it to leave 
the vicinity of the calcium channel, thereby 
preventing it from inhibiting the channel. 
The study puts Rad and other members of 
this family of proteins front and centre as play- 
ers incalcium-channel modulation. Is Rad the 
entire missing chapter in the story of PKA’s role 
in the heart, given Liu and colleagues’ com- 
pelling arguments that other potential PKA 
targets are unnecessary? Sceptics will want 


“Thestudy puts Rad and 
other members of this family 
of proteins front and centre 
as players in calcium-channel 
modulation.” 


further in vivo evidence froma type of mouse 
model termed a knock-in — animals whose 
original Rad sequence is replaced either with 
a version in which Rad’s own PKA-phospho- 
rylation sites are mutated or with a version 
in which the part of Rad needed for the inter- 
action with the B-subunit is eliminated — to 
see whether any PKA-mediated modulation 
of the calcium channel still occurs. Hints of 
differences between channel regulation in 
the embryonic and adult heart” also warrant 
further study. 

Might cardiac regulation by Rad be of 
clinical value? Heart failure in humans is 
associated with loss of regulation of calcium 
channels by B-adrenergic receptors. Rad 
levels fall during heart failure’, perhaps pro- 
viding atemporary increase in the strength of 
heart contraction’. However, this would also 
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reduce the heart’s ability to further increase 
its strength”, what is known as its functional 
reserve, which would be a severe price fora 
person’s heart to pay. 

There will undoubtedly be debate about 
how PKA modulation of calcium channels 
operates in neurons, suchas in PKA-responsive 
CA1 pyramidal cells in which Rad is essentially 
absent. In those neurons, the mutation of a 
particular serine (serine 1928) to alanine inthe 
L-type channel eliminates channel modulation 
and L-type channel-dependent strengthen- 
ing of inter-neuronal (synaptic) connections’. 
Here, PKA might be phosphorylating the 
calcium channel, after all. 

Organ-specific pathways for regulation 
would make functional sense. Rad can com- 
pletely inhibit calcium-channel activity, and 
so modifying such inhibition would give 
heart cells a wide range of regulatory capa- 
bility'’, suitable for a brief flight-or-fight 
response. Perhaps other cell types needing 
a more sustained but subtler boost to their 
calcium-channel activity might operate better 
without Rad-mediated regulation and rely 
instead on milder, more direct modulation 
of asubunit of the calcium channel. 

Liu and colleagues have set a high bar for 
future detective work on cellular signalling in 
the heart. Their work shows the power of asys- 
tematic round-up of suspects and relentless 
interrogation of their roles. 
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Amilestone inthe hunt 
for metallichydrogen 


Serge Desgreniers 


An optical study of cold solid hydrogen at extreme pressures 
indicates that electrons in the material are free to move like 
those ina metal. This suggests that the long-sought metallic 
phase of hydrogen might have been realized. See p.631 


Hydrogen isthe most abundant elementin the 
Universe. Its molecular-gas state is simple, but 
its solid state has proved to be complex. In1935, 
it was predicted that solid hydrogen should 
behave like an electrical conductor at elevated 
pressures, owing to its molecules being sep- 
arated into their atomic constituents". This 
prediction heralded a race to prove experimen- 
tally that solid hydrogen displays such metallic 
behaviour under ultrahigh compression. How- 
ever, although there have been many claims 
of proof (for example, refs 2-4), these stud- 
ies have been challenged. Now, on page 631, 
Loubeyre et al. report that dense hydrogen 
shows a discontinuous and reversible change in 
optical reflectivity at extreme pressure and low 
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Figure 1 | Effect of increasing pressure on cold solid hydrogen. a, Loubeyre 
etal.’ have studied solid hydrogen at extreme pressure and low temperature 
using a device known asa diamond anvil cell. This device compresses a sample 
of the material, which is confined to a microscopic chamber ina thin metal foil, 
between two diamond anvils. At first when the pressure is applied, the sample 
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temperature that can be attributed to a phase 
transition into a metallic state. 

Itiscommon practice to use a device called 
adiamond anvil cell to achieve ultrahigh com- 
pression of a material and to study changes in 
the material’s physical properties at high den- 
sity. A diamond anvil cell squeezes a sample, 
whichis confined toa microscopic chamber in 
athin metal foil, between two diamond anvils 
(Fig. 1a). The device operates ona deceptively 
simple physical concept: pressure is inversely 
proportional to the area of a surface over 
which a force is applied. In the present case, 
this simplicity comes with an inherent draw- 
back: reaching extreme pressures inevitably 
implies working with tiny sample volumes. 


b About 300 GPa 
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Conventional techniques have been the 
bottleneck in applying extreme pressures to 
highly compressible materials such as hydro- 
gen. Over the past few decades, research 
groups around the world have pushed the 
boundaries of pressure generation. They have 
also refined the tools and methods needed 
to accurately estimate pressures applied to a 
microscopic sample of compressed gas. Never- 
theless, debate continues over the accuracy of 
reported pressures and the interpretation of 
results drawn from measurements of physical 
properties. 

Recognizing this long-standing problem, 
Loubeyre and colleagues’ research group 
developed an innovative approach that 
involves the precise sculpting of diamond-anvil 
surfaces using a stream of massive ions® — a 
technique called focused ion-beam milling. 
Asimilar experimental development has also 
been reported’. The profiled anvils produce 
extreme pressures that can be reliably esti- 
mated, reaching more than 400 gigapascals 
(about 4 million times Earth’s atmospheric 
pressure). Moreover, the shape of the anvils 
helps to confine dense hydrogen samples that 
are suitable for optical measurements. 

Under increasingly extreme pressures, 
dense hydrogen becomes more and more 
opaque to visible light. For pressures in excess 
of about 300 GPa, solid hydrogen becomes 
penetrable only by electromagnetic radiation 
of lower energy than visible light? **, such as 
infrared radiation (Fig. 1b). Loubeyre et al. 
measured the optical transparency of solid 
hydrogen at pressures much higher than 
those reached previously, using the near- 
to-mid-infrared emission from a source of 
synchrotron radiation — electromagnetic radi- 
ation that is produced when charged particles 
are accelerated ina curved path. 

The authors found that a compressed 
sample of hydrogen blocks all light and 
exhibits an abrupt increase in optical reflec- 
tivity when the pressure is raised above 
425 GPa (Fig. Ic). Moreover, they discovered 
that this transition is reversible. The authors 


© Above 425 GPa 


Vy 


is transparent to both infrared and visible light (GPa, gigapascals). b, When the 
pressure is raised to roughly 300 GPa, the dense hydrogen loses its transparency 
to visible light. c, Finally, when the pressure is above 425 GPa, the sample 
becomes reflective to both infrared and visible light, indicating a shift into the 
long-sought metallic state of hydrogen. 


attribute the change in optical reflectivity to 
a pressure-induced phase transition in which 
electrons in the sample become free to move 
like those ina metal. Hydrogen remains as a 
molecular solid up to the transition pressure; 
it possibly stays in this state above 425 GPa, but 
itis difficult to confirm this by spectroscopy 
because there is a reduced coupling between 
light and matter in these extreme conditions. 

It can certainly be argued that a definite 
proof for metallic hydrogen would come only 
from ameasurement of the sample’s electrical 
conductivity at high pressure as a function 
of temperature. Solid hydrogen should 
exhibit a high level of electrical conduction 
that should then decrease as the sample 
temperature is raised. However, even with 
experimental techniques developed in the 
past few decades to study condensed matter 
in extreme conditions, electrical-transport 
measurements of hydrogen remain a huge 
challenge”””. 

Nevertheless, Loubeyre and co-workers’ 
findings should be considered as a close-to- 
definite proof of dense hydrogen reaching a 
metallic state in extreme-pressure conditions. 
Computational predictions of the pressure at 
which molecular hydrogen enters a metallic 
state still lack accuracy, because they require 
many different quaantum-mechanical correc- 
tions that are difficult to address. However, 
the experimental value of 425 GPa agrees 
with calculations" that predict a transition in 
hydrogen toa different solid phase at a similar 
pressure. 

Loubeyre and colleagues’ study has 
combined innovative techniques for ultra- 
high-pressure generation with advanced 
experimental methods using synchrotron 
radiation. In doing so, it has raised expecta- 
tions for the discovery of other remarkable 
properties of solid hydrogen at extreme 
density. For the time being, many questions 
remain. For instance, could electrical resistiv- 
ity be measured across the metallic transition? 
Could superconductivity at a record-high 
temperature be achieved in hydrogen? And 
could the molecular order be disrupted under 
ultrahigh pressure and lead to an atomic phase 
inthe solid state? 

Competition is still strong between different 
research groups seeking to answer these 
questions, and to further unveil and under- 
stand the characteristics of hydrogen at 
extreme density. More exciting findings are 
sure to come at every stage of the race. 
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Protein-structure 
prediction gets real 


Mohammed AlQuraishi 


Two threads of research in the quest for methods that predict 
the 3D structures of proteins from their amino-acid sequences 
have become fully intertwined. The result is a leap forward in 
the accuracy of predictions. See p.706 


Proteins perform or catalyse nearly all 
chemical and mechanical processes in cells. 
Synthesized as linear chains of amino-acid 
residues, most proteins spontaneously 
fold into one or asmall number of favoured 
three-dimensional structures. The sequence 
of amino acids specifies a protein’s structure 
and range of motion, which in turn deter- 
mine its function. Over decades, structural 
biologists have experimentally determined 
thousands of protein structures, but the dif- 
ficulty of these studies has made the promise 
of acomputational approach for predicting 
protein structure from sequence alluring. On 
page 706, Senior et al.’ describe analgorithm, 
AlphaFold, that takes a leap forward in solv- 
ing this classic problem by bringing to bear 
modern machine-learning techniques. 

The diversity of protein structures 
precludes the possibility of obtaining simple 
folding rules, making structure prediction 
difficult. Protein folding is ultimately driven 
by quantum mechanics. Were it possible to 
compute the exact energy of protein molec- 
ules from quantum theory, and to do so for 
every possible conformation, then predict- 
ing a protein’s most energetically favoured 
structure would be easy. Unfortunately, a 
quantum treatment of proteins is compu- 
tationally intractable (quantum computers 
might change this), and the total set of possi- 
ble conformations that any protein can take is 
astronomical, prohibiting such a brute-force 
approach. 

This has not stopped scientists from 
attempting a direct attack on the problem. 
Physical chemists have devised tractable, but 
approximate, energy models for proteins’, and 
computer scientists have developed ways to 
explore protein conformations®. Much pro- 
gress has been made onthe first problem but 
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the second has proved more recalcitrant. 
The set of shapes that a protein might take 
can be likened to a landscape: different loca- 
tions inthe landscape correspond to different 
shapes, with nearby locations having similar 
shapes. The height of a location corresponds 
to howenergetically favourable the associated 
shapeis, with the lowest point being the most 
favoured. Natural proteins evolved to have 
funnel-shaped landscapes that enable newly 
synthesized proteins, jostled by the thermal 
fluctuations of the cell, to cross the landscape 
and find their way to a favoured conformation 
in physiologically relevant timescales (milli- 


“Thealgorithm 
outperformed all entrants 
at the most recent blind 
assessment of methods 
used to predict protein 
structures.’ 


seconds to minutes)*. Algorithms can search 
the landscape to find favoured conformations 
by following the landscape’s inclination, but 
the ruggedness of the terrain causes them to 
get stuck in troughs and valleys far from the 
lowest basin. 

The course of the structure-prediction 
field changed nearly a decade ago with the 
publication of a series of seminal papers>’” 
exploring the idea that the evolutionary record 
contains clues about how proteins fold. The 
idea is predicated on the following premise: 
if two amino-acid residues in a protein are 
close together in 3D space, then a mutation 
that replaces one of them witha different resi- 
due (for example, large for small) will probably 
induce, at a later time, a mutation that alters 
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the other residue ina compensatory direction 
(in our example, swapping small for large). The 
set of co-evolving residues therefore encodes 
valuable spatial information, and can be found 
by analysing the sequences of evolutionarily 
related proteins. 

By transforming this co-evolutionary 
information into a matrix known as a binary 
contact map, which encodes which residues 
are proximal, the set of conformations that 
merit consideration by algorithmic searches 
can be restricted. This in turn makes it possi- 
ble to accurately predict the most favourable 
protein conformation, especially for pro- 
teins for which many evolutionarily related 
sequences are known. The idea was not new®, 
but the rapid growth in available sequence 
data in the early 2010s, coupled with crucial 
algorithmic breakthroughs, meant that its 
time had finally come. 

Co-evolutionary analysis has been respon- 
sible for most progress in protein-structure 
prediction inthe past few years, but it has not 
obviated the need for algorithms to search the 
energy landscapes of proteins: binary contact 
maps constrain the search space, but do not 
pin downa single 3D structure. Furthermore, 
the mathematics underpinning the conversion 
of co-evolutionary data into contact maps is 
restricted by the types of input used and the 
output generated. The initial injection of deep 
learning (a type of machine learning) into 
co-evolutionary analyses improved matters by 
incorporating richer inputs’. AlphaFold takes 
things a step further by changing the outputs. 

In lieu of binary contact data, AlphaFold 
predicts the probabilities of residues being 
separated by different distances. Because 
probabilities and energies are interconverti- 
ble, AlphaFold predicts an energy landscape — 
one that overlaps in its lowest basin with the 
true landscape, but is much smoother. In fact, 
AlphaFold’s landscape is so smooth that it 
nearly eliminates the need for searching. This 
makes it possible to use a simple procedure to 
find the most favourable conformation, rather 
than the complex search algorithms employed 
by other methods. 

The idea that a complex search could be 
unnecessary for structure prediction is, in 
hindsight, unsurprising. Mathematically, the 
distances between points determine their 
relative locations. Predictions of distances 
can therefore predict structure. Moreover, 
relatively simple models of protein energy 
landscapes known as Go potentials, in 
which experimentally determined distances 
between residues are favoured, can lead to 
protein-folding pathways that resemble ones 
experienced by real proteins’’. This suggests 
that proteins fold more like simple origami 
than like an intricate knot — all parts cancome 
together at once. My own work has shown 
that folding can be predicted implicitly using 
a deep-learning model without searching", 
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Figure 1| Predictions of protein structures. Senior et al.! report a machine-learning system called 
AlphaFold, which predicts the 3D structures of proteins from their amino-acid sequences. Template 
modelling (TM) scores measure how well a predicted structure matches the overall shape of the actual 
structure, onascale from 0 to1. TM scores for AlphaFold were better than those of other prediction systems 
for 25 out of 43 proteins ina blind test. Here, the TM scores for AlphaFold (red) are compared with those 

of other prediction systems (grey) in the blind test for six proteins whose 3D structures could be modelled 
only on the basis of their amino-acid sequences — no 3D structures of proteins that have similar amino- 

acid sequences were available to use as a starting point for modelling. AlphaFold made the most accurate 
predictions for five of these six proteins. (Adapted from Fig. 1b of ref. 1.) 


and minimal search procedures have also 
been embedded within another deep-learning 
model to predict protein structures”. 

What is notable about AlphaFold is that it 
predicts distances with sufficient accuracy to 
outperform state-of-the-art search methods 
(Fig. 1). Senior et al. used advances in deep 
learning to extract as much structural infor- 
mation as possible from protein sequences. 
The resulting algorithm outperformed all 
entrants at the most recent blind assessment 
of methods used to predict protein structures 
(the CASP13 event), generating the best struc- 
ture for 25 out of 43 proteins, compared with 
3 out of 43 for the next-best method. Alpha- 
Fold’s predictions had a median accuracy of 
6.6 angstroms on this set of proteins - that 
is, for the middle-ranked protein in this set, 
the atoms inthe proposed structures were on 
average 6.6 A away from their actual positions. 

Challenges remain. AlphaFold is not yet 
accurate enough for most applications, such 
as working out the catalytic mechanisms of 
enzymes or how drugs bind to proteins (which 
both typically require 2-3 A resolution). And 
although AlphaFold’s search procedure is 
much simpler than most modern methods, 
it can still be slow, taking tens to hundreds of 
hours to make a single prediction. For appli- 
cations suchas protein design, which require 
the structures of many different protein 
sequences to be modelled, the lack of speed 
is animpediment. 

Nevertheless, this isa watershed moment 
for the field. Given continued growth in the 
number of available protein sequences, it is 
possible that the coarse structures (about 
4 Aresolution) of most proteins that consist of 
asingle folded domain will become available in 
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the next five years from structure predictions. 
Such broad availability of structural informa- 
tion might transform the life sciences, just as 
sequence information did in the preceding 
decades. This could mean that, combined 
with the rapid advances in protein-struc- 
ture determination enabled by cryo-electron 
microscopy, we are entering a golden age of 
structural biology — one that makes possiblea 
quantitative and mechanistic basis for the life 
sciences, broadly grounded infirm structural 
hypotheses. 
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Developmental biology 


Single-cell maps 
of the human heart 


Ragini Phansalkar & Kristy Red-Horse 


Three methods for gene-expression profiling have now been 
combined to produce spatially defined single-cell maps of 
developing human organs from limited sample material, 
overcoming a major hurdle in studying human development. 


People are often curious about how their 
bodies work. So, itis no surprise that single-cell 
RNA sequencing (scRNA-seq) — which has the 
power to map all the cell types in the human 
body'?— has drawn great interest from scien- 
tists and funding agencies alike. But a major 
limitation of scRNA-seq is that it cannot pro- 
vide information about where in the original 
tissue each cell was located. Writing in Cell, 
Asp et al.* demonstrate a way of overcoming 
this hurdle by combining scRNA-seq with 
other sequencing methods that retain location 
information. They use this approach to create 
a spatially defined cell atlas of the developing 
human heart. 

ScRNA-seq involves dissociation of a tissue 
into hundreds or thousands of individual 
cells, each of which is analysed to determine 
its gene-expression profile. This profile indi- 
cates the proteins and pathways that are active 
in that cell — information that computational 
methods can then use to sort thousands of cells 
into different types or states at once. However, 
the tissue-dissociation protocol breaks the link 
between single cells and their original posi- 
tions in the tissue. Without this information, 
interpretation of the data is incomplete. Asp 
et al. set out to bridge this gap by combining 
scRNA-seq with two approaches that produce 
spatially defined gene-expression maps, 
although at alower resolution than scRNA-seq. 

The first is a technique called spatial 
transcriptomics®. Thin slices of tissue are 
placed ona specially prepared microscope 
slide that has been dotted with circular 
‘patches’. Each patch contains many copies 
of anucleic-acid probe that binds to messen- 
ger RNA in the tissue sample and carries a 
sequence called a barcode. Each patch has a 
different barcode, so that a specific label is 
attached tothe mRNA inthe area of tissue sit- 
ting ontop of that patch. Whenthe mRNA from 
each region is sequenced, the barcode acts as 
arecord of the cells’ original locations in the 
tissue. A limitation of this method is that sin- 
gle-cell analysis is compromised, because gene 
sequences are pooled from approximately 


30 cells in each patch. Nonetheless, a 
non-biased description of gene-expression 
profiles at discrete locations is obtained. 
The second method is in situ sequencing 
(ISS), in which the expression of preselected 
genes is probed directly ona tissue slice ona 
microscope slide®”. DNA probes are designed 
to bind to mRNA transcribed from genes of 
interest, with each probe carrying a unique 
barcode. When these probes are introduced 
into a tissue, they bind to their target MRNAS 
in the tissue’s cells and — after further pro- 
cessing steps — a fluorescent-imaging-based 


a scRNA sequence 


Dissociate cells 


@ ®@ ® 
ees 
@ @® @® 


| Sequence each cell 


sequencing method is used to detect the 
barcodes while they remain in place in 
the tissue. This technique can provide infor- 
mation about the expression of 50-100 genes 
in individual cells at high spatial resolution. 
But the genes must be preselected, which 
requires some knowledge about which genes 
will be informative. 

Asp and colleagues ultimately used ISS to 
achieve cell-level mapping of heart develop- 
ment. But to determine which genes they 
should select for ISS, they needed to perform 
scRNA-seq and spatial transcriptomics (Fig. 1). 
The authors’ scRNA-seq analysis revealed 
genes correlating with cell type — genes 
expressed only in smooth-muscle cells, for 
instance. And their spatial transcriptomics 
analysis revealed groups of genes that broadly 
correlated with certain locations in the heart — 
genes specifically expressed in cells inaregion 
called the outflow tract, for example. 

The researchers then created a panel of 
69 ISS probes — some corresponding to the 
location markers identified by spatial tran- 
scriptomics, some to the cell-type markers 
identified by scRNA-seq, and some to genes 
previously reported to be important for heart 
development. They combined the data pro- 
duced by this ISS screen with their scRNA-seq 
data using a recently developed algorithm® 
that assigns each RNA molecule detected by 
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Figure 1| A framework to map human development. Asp et al. combine three methods to produce single-cell 
maps of the developing human heart that retain spatial information. a, In single-cell RNA sequencing (scRNA- 
seq), a heart is dissociated into single cells and the RNA of each cell is sequenced to generate a gene-expression 
profile. A computer algorithm assigns a type to each cell, clustering together those that have similar gene- 
expression profiles. However, no information is retained about the original position of each cell in the tissue. 
b, Inspatial transcriptomics, slices of heart tissue are placed on microscope slides and many small regions of 
about 30 cells are sequenced. This provides information about gene expression in specific locations, but not 
at the single-cell level. c, The authors used the results of these screens to choose a panel of 69 genes that mark 
specific cell types or locations in the heart. They used a third sequencing approach, in situ sequencing (ISS), 
to analyse expression of these genes at the single-cell level across slices of human heart tissue. An algorithm 
assigns each RNA molecule to a cell within the tissue slice, generating spatially resolved maps of cell type. 
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ISS toa specific cell nucleus, and each nucleus 
toa specific cell type. The final result was the 
first spatiotemporal cell atlas of the embry- 
onic human heart between 6.5 and 7 weeks of 
development. Anybody can explore the atlas 
using a searchable online tool (see go.nature. 
com/3rjédtf). 

Beyond a technical proof-of-principle, 
the authors also gained insights into heart 
development. For example, a previous study” 
described several distinct, but transcription- 
ally similar, clusters of fibroblast-like cells 
—astructural cell type that is not fully under- 
stood, but which is known to participate ina 
pathological tissue-scarring process called 
fibrosis. Asp et al. demonstrated that these 
fibroblast subgroups are located in distinct 
parts of the heart, providing a clue as to how 
they might function differently. 

As another example, Asp and colleagues 
described a previously unknown human equiv- 
alent of a subpopulation of cardiac muscle 
cells found in mice that expresses high levels of 
the gene Myoz2 (ref. 10). Finally, the research- 
ers performed a cellular analysis of a tissue 
called the atrioventricular mesenchyme, and 
identified the time point at which Schwann 
cells (neuron-associating cells that ensure 
proper electrical transmission) arise in this 
tissue. Such knowledge of human heart cells 
can be used to inform follow-up experiments 


aimed at defining those cells’ functions. 
However, limitations in Asp and colleagues’ 
study highlight that pre-existing knowledge is 
still required to fully appreciate single-cell data. 
For instance, some of their cell populations 
seem to be misidentified. One population is 
deemed by the authors to be cells that line capil- 
lary blood vessels (coronary endothelium). But 
on the basis of the cells’ marker-gene expres- 
sion and location, we would suggest that they 


“The techniques have 
the crucial advantage 

of providing a wealth of 
information from limited 
tissue.” 


are instead endocardial cells that line the heart 
chamber. Furthermore, the authors’ approach 
could not be used to distinguish between all of 
the subtypes or substates in particular popula- 
tions, suchas the coronary endothelium, which 
is amixed population that lines the arterial, 
venous and capillary blood vessels leading 
to and from the heart. However, the authors’ 
workflow is certain to be refined as advanced 
techniques are developed for directly profiling 
the spatial gene expression of tissues”. 

A major strength of Asp and colleagues’ 


study is that it provides a framework for 
maximizing the power of scRNA-seq. Anyone 
who begins experiments to understand human 
biology faces the challenge of limited sample 
availability, particularly when assessing embry- 
onic development. The techniques presented 
here have the crucial advantage of providing 
a wealth of information from limited tissue. 
Asp and co-workers’ strategy is likely to bene- 
fit efforts such as the Human Cell Atlas (www. 
humancellatlas.org) and HuBMAP (https:// 
hubmapconsortium.org) projects, which are 
both seeking to fully define the human body. 
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Hydrogen has been an essential element in the development of atomic, molecular and 
condensed matter physics’. It is predicted that hydrogen should have a metal state’; 


however, understanding the properties of dense hydrogen has been more complex 
than originally thought, because under extreme conditions the electrons and protons 
are strongly coupled to each other and ultimately must both be treated as quantum 
particles**. Therefore, how and when molecular solid hydrogen may transform into a 
metal is an open question. Although the quest for metal hydrogen has pushed major 
developments in modern experimental high-pressure physics, the various claims of 
its observation remain unconfirmed>”. Here a discontinuous change of the direct 
bandgap of hydrogen, from 0.6 electronvolts to below 0.1 electronvolts, is observed 
near 425 gigapascals. This result is most probably associated with the formation of the 
metallic state because the nucleus zero-point energy is larger than this lowest 
bandgap value. Pressures above 400 gigapascals are achieved with the recently 
developed toroidal diamond anvil cell®, and the structural changes and electronic 
properties of dense solid hydrogen at 80 kelvin are probed using synchrotron infrared 
absorption spectroscopy. The continuous downward shifts of the vibron wavenumber 
and the direct bandgap with increased pressure point to the stability of phase-IIl 
hydrogen up to 425 gigapascals. The present data suggest that metallization of 
hydrogen proceeds within the molecular solid, in good agreement with previous 
calculations that capture many-body electronic correlations’. 


The search for metal hydrogen has a unique place in high-pressure 
physics. Indisputably, metal hydrogen should exist. Owing to increase 
in electron kinetic energy because of quantum confinement, pres- 
sure should turn any insulator into a metal, as observed for molecular 
oxygen around 100 GPasome 20 years ago”. At first, the prediction of 
the insulator—metal transition in dense hydrogen was intertwined with 
the molecular dissociation”. However, it was later suggested that metal 
hydrogen may exist as a proton-paired metal”. Quantitative predic- 
tions of the stability domain and of the properties of metal hydrogen 
remain challenging because many contributions could be in effect 
and should be self-consistently treated?*; for example, many-body 
electronic correlations, nuclear quantum effects, nuclear spin order- 
ing, coupling between protons and electrons (as suggested by a large 
Born-Oppenheimer separation parameter), or anharmonic effects. 
The most advanced calculations, suchas diffusion Monte Carlo (DMC) 
simulations*””, now go beyond the electronic correlation mean-field 
description of density functional theory and try to capture many-body 
electronic correlations. Importantly, metal hydrogen should exhibit 
notable properties, such as room-temperature superconductivity”® », 
a melting transition at a very low temperature into a superconducting 
superfluid state’ and a mobile solid state”. 


The change in the direct bandgap of solid hydrogen was previously 
measured up to 300 GPa by visible absorption mesurements”. By 
extrapolating to zero the linear decrease of the bandgap with den- 
sity, the transition to metal hydrogen was predicted to occur around 
450 GPa. In this work, we extend the investigation of the direct band- 
gap decrease down to the near-to-mid-infrared energy range. Infra- 
red measurements provide a non-intrusive method both to disclose 
structural changes and also to characterize the electronic properties 
of hydrogen up to its metal transition. Our approach is based on two 
experimental developments. First, in order to overcome the 400 GPa 
limit of conventional diamond anvil cells”, we used the recently devel- 
oped toroidal diamond anvil cell (T-DAC)® that can achieve pressures 
of up to 600 GPa. Importantly, under extreme pressures, the T-DAC 
preserves the advantages of the standard diamond anvil cell in terms 
of stress distribution, optical access and sample size. Synthetic type- 
Ila diamond anvils were used to provide infrared transparency down 
to800cm™. Second, an infrared horizontal microscope was designed 
to be coupled toa collimated exit port of asynchrotron-feed Fourier- 
transform infrared spectrometer at the SMIS beamline at the SOLEIL 
synchrotron facility. Such a high-brightness broadband infrared source 
is essential for measuring, by transmission, satisfactory signal-to-noise 
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Fig. 1| Aselection of measurements over the investigated pressure range. 

a, Photographs of the hydrogen sample taken at different stages of 
compression, under simultaneous front and back bright-light illumination. The 
hydrogen sample is indicated by the blue arrow. Around 310 GPa, the sample 
reversibly turns black, as illustrated by the photographs taken at 315 GPa for the 
increasing pressure path and at 300 GPa for the decreasing pressure path. At 
427 GPa, the sample is inthe metallic state and is still distinguishable from the 
rhenium gasket. The red-coloured aspect at the diamond tip centre is 
attributed to the decrease of the diamond bandgap’. b, Infrared transmission 


ratio spectra through a 5-ym diameter sample. Simultaneous Raman 
spectroscopy and visual observation could be performed. 

Figure 1illustrates selected data obtained for the hydrogen sample 
inthe T-DAC at 80 K during different compression and decompression 
stages. Four photographs (Fig. 1a) show the changes in the sample 
appearance. The formation of black hydrogen-that is, the transfor- 
mation of hydrogen from transparent to totally opaque in the visible 
range—is observed around 310 GPa, as previously reported’, and this 
was reversible upon pressure release. The visual transformation of the 
sample at the probable insulator-metal transition is less contrasted. 
The observed metal-hydrogen sample is not highly reflective, as it 
appears darker than the surrounding rhenium gasket. As discussed 
below, this is consistent with the formation of a molecular metal instead 
of an atomic metal. The infrared signal was collected over the 800- 
8,000 cm™ wavenumber range; several raw spectra are shown in Fig. 1b. 
Up to 360 GPa, the signal intensity decreased owing to the shrinkage 
of the hydrogen sample size and the deformation of the toroidal anvil 
tip®. However, after intensity normalization, from 123 GPa to 360 GPa 
all spectra display the same shape when the hydrogen vibron peak is 
discarded (see Extended Data Fig. 1). Consequently, the variation of 
the infrared transmittance of the diamond anvil itself should remain 
negligible up to the 400 GPa range. Therefore changes in the infrared 
spectra can only be due to intrinsic properties of hydrogen. In Fig. 1a, 
two interesting features are clearly seen: (1) the strong absorption peak 
around 4,000 cm tis associated with the H, vibron that appears above 
160 GPa upon the solid entering phase Ill, as reported previously”°. This 
vibron mode broadens and shifts to lower wavenumbers with increasing 
pressure; (2) above 360 GPa, the shape of the infrared spectra display 
zeroing at high wavenumbers, evolving towards low values with pres- 
sure (see Extended Data Fig. 2), which indicates the decrease of the 
hydrogen direct bandgap in the infrared range. Importantly, a very 
discernible Raman diamond edge (see inset Fig. 1c), used as the pres- 
sure gauge, could be measured up to the maximum pressure and upon 
release, as a result of an elastic deformation at the diamond anvil tip 
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facilitated by the toroidal shape. In Fig. Ic, the evolution of the sample 
pressure versus the force on the piston features the expected trend®”. 

In Fig. 2a, absorbance spectra have been obtained by taking a direct 
ratio of the spectrum ata given pressure to that taken at 123 GPa (after 
intensity normalization). For infrared measurements of semiconduc- 
tors under pressure, the direct excitonic level (in the case of hydro- 
gen, the values of the excitonic and of the direct bandgap should be 
almost identical)”is positioned at the junction between the absorbance 
plateau and the lower energy tail, as done previously to position the 
hydrogen bandgap in the visible range”’. In the present experimental 
configuration, a maximum absorbance value of just 2 could be meas- 
ured. Hence, alower bound for the bandgap should probably be inferred 
because the absorbance plateau might be at a higher value. However, 
because the hydrogen sample was about 1.6 (+ 0.1) pm thick, the absorp- 
tion coefficient associated with an absorbance of 2 is estimated to be 
about 28,000 cm“, whichis similar to the value obtained from the direct 
bandgap measurements in the visible range’. The bandgap underesti- 
mation should be smaller than 0.14 eV. Around 425 GPa, atransitiontoa 
total infrared absorption is observed, corresponding to an absorption 
coefficient greater than 25,000 cm’ over the whole infrared spectral 
range investigated. This is anecessary condition for the infrared obser- 
vation of metal hydrogen but not definitive evidence, because the exist- 
ence of a direct bandgap less than 0.1 eV—that is, below the 800 cm™ 
lower limit of the covered infrared spectral range—cannot be ruled out, 
although that seems unlikely because the nucleus zero-point energy is 
greater than this value. In Fig. 2c, the discontinuity of the transition is 
evidenced by the pressure evolution of the integrated infrared intensity 
over the 800-2,000 cm‘ wavenumber range. Upon pressure release, 
the infrared spectral intensity and shape are reversibly recovered (see 
Extended Data Fig. 2). The C2/c structure with 24 atoms per unit cell, 
henceforth C2/c-24, has been calculated to be the most probable can- 
didate in the pressure range of the present measurements™. If so, from 
electronic band structure calculations”, an indirect bandgap should 
close under pressure before the direct bandgap does. Consistent with 
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Fig. 2| Discontinuous pressure evolution in the infrared absorption and 
probable signature of metal hydrogen. a, Absorption spectra of hydrogen at 
different pressures. Above 386 GPa, the direct electronic bandgap is indicated 
by arrows. For clarity, different colours are associated with different pressures. 
b, Transmission spectra over the infrared range 800-2,000 cm”. The pressure 


this picture, hydrogen was recently reported” to start conducting 
above 350 GPa (or 330 GPa, if the diamond pressure scale used here is 
applied). Then up to 440 GPa (or 395 GPa in our case), the value of the 
resistivity (about 5 x 10* O m), and its temperature dependence sug- 
gest a semimetal state with alow concentration of charge carriers. The 
associated plasma edge should be below 0.1 eV, which is why it could not 
be detected here. For comparison, the plasma edge of the low-density 
charge carriers conducting xenon could only be measured using the 
present synchrotron infrared setup™ just above the xenon metallization 
pressure when the resistivity falls below 1 x 10 O m (ref. 7). 
Properties of solid hydrogen up to the metal transition are shown 
in Fig. 3. In Fig. 3a, the linear pressure shift of the vibron wavenumber 
from 160 GPa to 425 GPa indicates that no structural change occurs up 
to 425 GPaand that the solid hydrogen remains in phase Ill. The C2/c-24 
candidate structure for phase III was first proposed using an ab initio 
random structure searching method”. It consists of layers of molecules 
forming a slight monoclinic distortion of the hexagonal lattice. The 
C2/c-24 structure has a specific infrared fingerprint”, exhibiting one 
intense vibron infrared mode as well as one intense phonon infrared 
mode, in good agreement with the present observation. The phonon 
modeis, to our knowledge, reported here for the first time but could 
be observed only up to 225 GPa, above which it becomes hidden by the 
strong absorption of the diamond anvils. The vibron wavenumber shift 
was reversibly observed upon pressure decrease (see Extended Data 
Fig. 3). Finally, the calculated” pressure evolutions of the vibron and 
of the phonon infrared wavenumbers for the C2/c-24 structure are in 
very good agreement with the present experimental data. In Fig. 3b, 
the direct bandgap is seen to decrease linearly with pressure, and is 
well matched with the previous measurements in the visible range’®. 


Pressure (GPa) 


colourscale is asina.c, Integrated transmitted intensity over the 800-2,000 cm 
infrared range for increasing (red) and decreasing (blue) pressure. Pressure 
uncertainty is +10 GPa. Errors are the random uncertainty estimated from 
different measurements at the same pressure, for typically three 
measurements. The dashed lines inb, care guides to the eye. 


In calculations, the bandgap is profoundly affected by the level of the 
description of electronic exchange-correlation and also by nuclear 
quantum effects. Previous work that has used local exchange-correla- 
tion density functional theory, for example using the Perdew-Burke- 
Ernzerhof (PBE) calculation” or implementing vdW-DF2”6, has obtained 
unreliable bandgaps, and has usually underestimated its value. The 
more advanced methods of quasi-particle computational approaches, 
GW” and DMC’, should be more reliable. In Fig. 3b, they are seen to give 
higher bandgap values than obtained by experiment. Accounting for 
nuclear quantum effects should lower the bandgap energy” by at least 
leV and therefore a better agreement with the present data should 
be obtained. It is interesting to note that there is a confluence of the 
bandgap and of the vibron energy values, both about 0.5 eV, when the 
transition to the probable metal state occurs. 

Currently, there are five experimentally described phases of hydro- 
gen*”’,. Recently, a single crystal X-ray diffraction study”* at 300 K up 
to 254 GPa showed that the I, III and IV phases are isostructural, and 
the hydrogen molecules remain in the hexagonal close-packed crystal 
lattice structure. Phases IV and V exist only above 200 K (refs. 2”). An 
updated low-temperature phase diagram of solid hydrogen is shown in 
Fig. 4. In phase I, the molecules are in a quantum free-rotational state 
and arranged ona hexagonal close-packed lattice. Upon transitioning 
to phase Il, very small discontinuities in the lattice parameters have 
been measured”’. Quantum molecular rotations become restricted and 
phase II can be described as a quantum fluxional solid*°. At 160 GPa, 
phase II transforms into an ordered molecular phase III. We suggest here 
that phase III has the C2/c-24 structure up to 425 GPa, above whicha 
discontinuous transition to metal hydrogen occurs. It should be noted 
that the infrared spectrain phase II at 300 K and at 100 K are reasonably 
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Fig. 3 | Evolution of the molecular solid hydrogen properties up tothe 
insulator—-metal transition: comparison between experiment and 
calculations. a, Wavenumbers of the vibron and of the phonon infrared active 
modes in phase Ill versus pressure. The black dots and the red lines represent 
experimental data and linear fits. The blue squares and the green triangles are 
the density functional perturbation theory” and first-principles molecular 
dynamics” calculations for the C2/c-24 structure, respectively. Inset, the 
infrared absorption spectrum at 190 GPa, showing the vibron and the phonon 
peaks. Comparison of the present data with previous experimental 
determinations of the infrared H, vibron wavenumber versus pressure is shown 
in Extended Data Fig. 4. b, The pressure evolution of the experimental bandgap 
(combining data in the visible range’ and the present infrared data; black 
symbols), iscompared to calculations for the C2/c-24 structure performed 
under a variety of approximations: within the DFT framework with local PBE 
(orange squares) and nonlocal vdW-DF2 (green squares) exchange-correlation 
functionals”®; with the quasiparticle approach within the GW approximation? 
(orange triangles); and with the DMC method” (blue triangles). The vertical 
dashed line indicates the transition to metal hydrogen. The uncertainty onthe 
bandgap is 0.14 eV, estimated from Fig. 2a by linearly extending the rising 
absorbance before the plateau of 2 to an absorbance value of 3. The dashed 
lines that follow the data are guides to the eye. Evolution of the direct bandgap 
versus density, using a revisited hydrogen compression curve to convert 
pressure into density, is shown in Extended Data Fig. 5. Vibronand bandgap 
data are presented in Extended Data Table 1. 


different and are associated with a substantial temperature shift of the 
vibron wavenumber". The orientational ordering of the molecules 
could be increased by lowering the temperature, hence inducing the 
monoclinic distortion from the hexagonal close-packed lattice at 300 K 
to form the C2/c-24 structure at 80 K. In the framework of PBE density 
functional theory, the C2/c-24 structure should be stable only up to 
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Fig. 4| Low-temperature phase diagram of solid hydrogen. The red dashed 
line is the pathway of the infrared data collection and the red triangle indicates 
the transition to metal hydrogen. Boundary lines between phases I, Iland Ill are 
from previous studies (as reviewed in ref. *). The sequence of phase transitions, 
determined by DMC calculations including nuclear quantum effects’, is 
displayed at the top of the graph. 


270 GPa". Using DMC calculations and including nuclear quantum 
effects, a transition from an insulating C2/c-24 structure to a metallic 
Cmca-12 structure is obtained at 424 GPa’, in good agreement with the 
pressure at which we observe total infrared absorption above 800 cm". 
That structural transition should be displacive, implying an orientation 
ordering in the layers of the H, molecules from nearly parallel to flat, 
which should induce a larger distortion from hexagonal packing. That 
could take place with almost no pressure hysteresis. Furthermore, the 
reflectivity in the visible range for the Cmca-12 molecular metal has been 
calculated* to be about 0.5, and so this molecular metal should appear 
darker than the rhenium gasket. Consequently, the full infrared absorp- 
tion, the small hysteresis of the transition and the photograph of the 
sample above 425 GPa all suggest that we have observed the hydrogen 
insulator-to-metal transition inthe molecular crystal, associated with 
astructural transition from C2/c-24 to Cmca-12. Following DMC calcu- 
lations’, atomic metal hydrogen should be observed above 447 GPa. 

More measurements are now needed to definitively prove the hydro- 
gen transition toa metallic state. It seems particularly appropriate to 
try to observe the predicted high-temperature superconductivity of 
metal hydrogen in both the molecular and atomic hydrogen-metal 
phases, at 250 K (ref. “) and 350 K (ref. *), respectively. For doing so, 
anon-invasive infrared reflectivity measurement has recently been 
proposed*, and the sample size reachable using the T-DAC willimprove 
our ability to make such a measurement. 
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Methods 


The toroidal diamond anvil cell 

The toroidal shape of the synthetic single-crystal diamond anvil tip 
was prepared by focused ion beam machining. Scanning electron 
micrograph and profile of the toroidal tip are given in Extended Data 
Fig. 6. The central flat diameter, groove and depth are 25 pm, 80 pm 
and 4.6 pm, respectively. The toroidal tip has been recovered intact 
upon full pressure release, indicating its purely elastic deformation 
up tothe highest pressure reached. That indicates that no irreversible 
transformation of the anvil at tip in contact with the hydrogen sample 
has perturbed the absorption measurements. Breakage on the bevelled 
slope of the anvil is observed, as in standard DAC, but outside of the 
focused ion beam-machined central part. 

High pressures were generated using the T-DAC, consisting of a 
LeToullec membrane diamond anvil cell equipped with Boehler-Almax 
type seats of polycrystalline diamond and equipped with toroid-shaped 
diamond anvils. The hydrogen sample was loaded in the T-DAC under 
a pressure of 140 MPa. A focusedion beam-drilled rhenium gasket was 
used. The sample pressure was slowly increased to enable the gradual 
elastic deformation of the anvil tip by changing the membrane pres- 
sure pushing the piston with a rate of 0.2 bar min”. The red colour 
of the diamond tip is gradually observed above 350 GPa, reversibly 
disappearing upon pressure release, owing to the diamond bandgap 
closing within the visible range at the diamond tip®**. 

The sample thickness, 1.6 + 0.1 1m, was estimated by the conserva- 
tion of the hydrogen mass between loading (33.6 cm? mol!” of hydro- 
gen loaded ina gasket hole 14 pm in diameter and 6 pm thick) and the 
sample at 400 GPa (1.6 cm’ mol ‘ina diameter of about 5.8 + 0.2 um). 

The conversion between the membrane pressure, P,,,, and the force 
onthe piston, F, is F[KN] = 0.05  P,, [bar]. 


Pressure measurement 

The high-frequency step of the T,, Raman spectra of the stressed dia- 
mond anvil was used to measure the pressure at the hydrogen sample. 
The hydrogen sample pressure is related to the diamond-edge wave- 
number by the Akahama calibration®. The revision” of this calibration 
curve up to the 400 GPa pressure range was discarded because the 
pressure appeared to be overestimated, on the basis of the following: 
(1) in our previous measurements with toroidal anvils using X-ray dif- 
fraction®, the rhenium pressure gauge gave values in better agreement 
with the original pressure scale; (2) as seen in Extended Data Fig. 7, the 
pressure evolution versus the membrane pressure and the infrared 
vibron wavenumber versus pressure exhibit unphysical behaviours 
when using the revised scale; and (3) the revised scale is based on the 
Pt equation of state* that has been recently shown to overestimate 
pressure’®. The pressure of the bandgap data measured previously in 
the visible range’’ has been corrected using the same pressure calibra- 
tion as in the present study (—20 GPa). No difference up to a pressure 
of 330 GPa, estimated with the diamond-edge Raman pressure gauge, 
could be observed between operating the toroidal anvils or the stand- 
ard bevelled anvils, by looking at the infrared H, vibron wavenumber 
versus pressure (see Extended Data Fig. 3). The error bars in the pres- 
sure measurements, +10 GPa, arise from the random uncertainties 
originating from the positional accuracy of the sample and the stress 
field at the tip. The systematic uncertainties owing to the pressure 
calibration scale are not taken into account. The estimated pressures 
may be corrected in the future if the diamond-edge pressure gauge 
calibration is refined. The pressure scale used here is: 


PIGPa] = 54742 a a, 3755 | 
Wo Wo 


where Aq is the frequency shift of the diamond edge and w, = 
1,334cm". 


Infrared bench 

A photograph of the bench is shown in Extended Data Fig. 8. This bench 
was previously used to characterize the infrared vibrational modes of 
phase IV of hydrogen at 300 K™. The custom-made horizontal infrared 
microscope is equipped with two infinity-corrected long-working- 
distance Schwarzschild objectives (working distance, 47 mm; numerical 
aperture, 0.5; magnification, 15x) that produce a 23 um (full width at 
half maximum, FWHM) infrared spot at a wavelength of 10 pm. This 
beam size is reduced inside the T-DAC owing to the effect of the dia- 
mond refractive index. The spatial and temporal stability of the broad- 
band infrared beam enabled us to record transmission spectra, with 
a good signal-to-noise ratio over the range 800-8,000 cm fora hole 
of diameter 6 pm (see Extended Data Fig. 8). One of the Schwarzschild 
objectives is mounted ona300-mmtranslation stage to free up space 
behind the cryostat in order to insert the optical head for Raman spec- 
troscopy measurements without moving the T-DAC. The Raman signal 
was excited by a 660-nm-wavelength laser limited to 3 mW power out- 
put above 300 GPa to prevent thermal heating and hence breakage of 
the toroidal tip. The Raman head is also equipped with a digital camera 
to take photographs of the sample at each pressure. The quality of the 
measurements obtained also relies on the high mechanical stability of 
the bench and onthe high reproducibility in position upon swapping 
between the infrared and the Raman configurations. Infrared spectra 
were collected witha4 cm ‘resolution and 1,024 scans, with an electron 
beam current of 450 mA (top-up mode). 


Absorption spectra 

All spectra were recorded using the transmission geometry and then 
divided by a reference transmission spectrum, here taken at 123 GPa. 
Such a pressure is high enough to avoid any Fabry-Perot interference 
signal coming from the two parallel diamond interfaces of high refrac- 
tive index enclosing the sample, but with the absence of the strong H, 
vibron absorption which appears above 160 GPa in phase Ill. The overall 
intensity (peak-to-peak value of the interferogram) of the spectra has 
been normalized by that at 310 GPa to take into account the change in 
the hydrogen sample diameter owing to its compression. It was observed 
that above 310 GPa, the sample size is almost constant upon pressure 
increase and decrease. The absorbance is defined as A =-log,,(/(v)/I,(v)), 
where _/is the intensity of the raw spectrum at a frequency v. 

Inthe present experimental configuration, the maximum absorbance 
value that could be reasonably measured is approximately 2, indicating 
that less than 1% of the reference spectrum signal is detected. Con- 
sidering the small gasket hole, the detection of the signal remained 
challenging and required using the detector gain at maximum value, 
inducing an increase of the intrinsic detector noise. 


Data availability 


The data that support the findings of this study are available from the 
corresponding author upon request. 
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Extended Data Fig. 1| Infrared spectra below 350 GPa. a, Single-beam 
spectrum at 123 GPa, used as the reference spectrum for the absorption 
spectra. The three red stars indicate parasitic effects corresponding to, from 
right to left, absorption peaks of impurities around 2,800 cm", abroad 


absorption band from the diamond (1,900-2,300 cm”), anda broad absorption 
around 1,200 cm from the protected layers of the aluminium mirrors of the 
beamline. b, Single-beam spectra, after intensity normalization (peak-to-peak 
value of their respective interferogram before Fourier transform). IR, infrared. 
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Extended Data Fig. 2 | Infrared spectra above 350 GPa with pressure increase and pressure decrease. a, b, These spectra show a zeroing at high wavenumbers, 
progressing towards low wavenumber values with pressure increase (a) and reversibly, to the opposite direction upon pressure release (b). 
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Extended Data Fig. 3| Infrared vibron and phonon wavenumbers versus 
pressure. a, The symbols indicate: red, pressure increase; blue, pressure 
decrease; and black, previous work performed using standard diamond anvil 
cells. The filled dots indicate the pressure measured by the diamond pressure 
scale and the opensymbolsare the position according to the linear vibron shift as 
afunction of pressure. Inset, infrared spectrum at 406 GPa, in arbitrary 
absorbance units, with the vibron peak indicated bya red star. The linear fit of the 
vibron wavenumber with pressure is given by: Uyipron [CM] = 4,814 -2.48P [GPa] 
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Extended Data Fig. 4 | Infrared vibron wavenumber versus pressure. The present data (black dots), and their linear fit (red dashed line) are compared to the 


linear fits of previous measurements” up to 360 GPa. Pressure uncertainty is +10 GPa. 
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Extended Data Fig. 5 | Hydrogen equation of state and evolution of the direct 
electronic bandgap versus density. a, Equation of state of solid hydrogen 
around 80K. The black dots are neutron diffraction measurements”. The blue 
dots are our unpublished X-ray diffraction data obtained at the European 
Synchrotron Radiation Facility; the pressure scale is based on the revised ruby 
scale*’. The red line is the fit of the experimental data by a Vinet form”: 
P=3K)(1—X)Xexp[3/2(Ko’ — 1)(1-X)], with X= (V/Vp)"2, Ko = 0.191 GPa, Ko’ = 7.039 
(where K,’ = dK,/dP), and V, =23 cm? mol". The present equation of state is in 
good agreement with that measured previously”. Inset, the Vinet form canbe 
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reformulated in terms of expressions analogous to normalized stress, 

In[H(X)] = In[PX?/3(1-X)], and Eulerian strain, (1—X). This gives: In[H(X)] = Ink, + 
3/2(K,y’ -1)(1-X). The linear fit of the data is shown. b, Evolution of the direct 
bandgap of solid hydrogen with density, for pressure increase (red), pressure 
decrease (blue) and froma previous study in the visible range’* (green). The 
vertical rectangles indicate the maximum 0.14-eV underestimation of the 
bandgap owing to the limited absorbance value of 2 that could be 

measured. The density uncertainties (t0.007 H, molecm*®) are obtained by 
propagating the +10 GPa pressure uncertainties. 


Extended Data Fig. 6| Geometry of the toroidal shape. a, Scanning electron recovered after pressure release. The toroidal part of the anvil is intact. Ring 
micrograph of the anvil. b, Profile of the toroidal diamond tip measured by cracks are seen on the bevel of the anvil, at a diameter of about 150 um. 
interferometry. c, Scanning electron microscope image of the toroidal anvil 
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Extended Data Fig. 7 | Comparison of the original® and revised** Akahama 
diamond pressure scales for two measurements. The two scales deviate 
above 300 GPa. a, Sample pressure-load curve. The force on the piston of the 
T-DAC is linearly related to the helium gas pressure inflating the membrane, 
F[kN]=0.05 x P,, [bar]. The revised scale (orange) gives aconvex evolution 
above 300 GPa that is mechanically incorrect. b, Infrared H, vibron 
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wavenumber versus pressure. Above 300 GPa, using the original scale (red), the 
shift is linear, in agreement with the calculated trend in phase III’?>; however, 
using the revised scale (dots and orange line), asublinear shift is observed. The 
error bars inthe pressure measurements (+10 GPa) arise fromthe random 
uncertainties originating from the positional accuracy of the sample and the 
stress field at the diamond anvil tip. 
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Extended Data Fig. 8 | Experimental setup on the horizontal infrared 
microscope at the SMIS beamline of the SOLEIL synchrotron. a, The L-N, 
flow cryostat containing the T-DAC sits between the two Schwarzschild 
objectives for infrared transmission measurements. Inthe Raman 
configuration, one of the Schwarzschild objectives is swapped by the Raman 
head. Both configurations are reproducibly recovered within about 2 um, 


because they are mounted ona precise, long-travel translation stage. b, Raw 
spectra recorded through two calibrated gasket holes 6 pm and 3 pmin 
diameter, made by focused ion beam machining ina rhenium gasket, and 
positioned between the two diamond anvils of the T-DAC. The spectra were 
recorded witha resolution of 4cm “and after 400 accumulations. 
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Extended Data Table 1 | Infrared vibron wavenumber, direct bandgap and corresponding pressure 
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The pressure uncertainty is +10 GPa, the bandgap uncertainty is +0.14 eV and the vibron wavenumber uncertainty is +4 cm". Top, increasing pressure; bottom, decreasing pressure. 
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Dualities are mathematical mappings that reveal links between apparently unrelated 
systems in virtually every branch of physics’ ®. Systems mapped onto themselves by a 


duality transformation are called self-dual and exhibit remarkable properties, as 
exemplified by the scale invariance of an Ising magnet at the critical point. Here we 
show how dualities can enhance the symmetries of a dynamical matrix (or 
Hamiltonian), enabling the design of metamaterials with emergent properties that 
escape a standard group theory analysis. As an illustration, we consider twisted 


kagome lattices’ * 


, reconfigurable mechanical structures that change shape by 


means of acollapse mechanism’. We observe that pairs of distinct configurations 
along the mechanism exhibit the same vibrational spectrum and related elastic 
moduli. We show that these puzzling properties arise from a duality between pairs of 
configurations on either side of a mechanical critical point. The critical point 
corresponds to a self-dual structure with isotropic elasticity even in the absence of 
spatial symmetries and a twofold-degenerate spectrum over the entire Brillouin zone. 


The spectral degeneracy originates froma version of Kramers’ theorem 


16.17 in which 


fermionic time-reversal invariance is replaced by a hidden symmetry emerging at the 
self-dual point. The normal modes of the self-dual systems exhibit non-Abelian 
geometric phases” that affect the semiclassical propagation of wavepackets”, 
leading to non-commuting mechanical responses. Our results hold promise for 
holonomic computation” and mechanical spintronics by allowing on-the-fly 
manipulation of synthetic spins carried by phonons. 


Symmetries and their breaking govern natural phenomena from fun- 
damental particles to molecular vibrations”. They are also powerful 
tools to design synthetic materials from chemical compounds to meta- 
materials”, Here we start by asking a question almost deceptive in its 
simplicity: what is a symmetry? A symmetry is a transformation that 
maps a system onto itself, as illustrated by the threefold rotation C, 
acting on the structure in Fig. 1a. A duality, on the other hand, relates 
distinct models or structures’ *. A celebrated example is the Kramers- 
Wannier order-disorder duality!” between the low- and high-temper- 
ature phases of the two-dimensional Ising model, pictured in Fig. 1b. In 
self-dual systems, the distinction between dualities and symmetries is 
blurred: additional symmetries can emerge at a self-dual point even if 
the spatial symmetries are unchanged. This is what occurs in the critical 
configuration of the mechanical system shown in the middle panel of 
Fig. 1c. Here we show howsuch dualities can be harnessed to engineer 
material properties from wave propagation to static responses that are 
not predicted by a standard symmetry analysis based on space groups. 

Mechanical structures are described at the linear level by normal 
modes of vibration and their oscillation frequencies. Both are deter- 
mined by the dynamical matrix D, which summarizes the Newton equa- 
tions of motion in the harmonic approximation 07|) =-D|p). The 
vector |) has components %,= Uy, /My, where u, is the displacement 
of particle p with mass m, from its equilibrium position. The eigenvec- 
tors |p;) and eigenvalues w? of the dynamical matrix, such that 
D\o,) = w;|Q,) are the normal modes of vibration and the corresponding 


angular frequencies. In a spatially periodic system, the spectrum of 
the Bloch dynamical matrix D(k) is organized in frequency bands with 
dispersion relations wk) parameterized by quasi-momenta k forming 
the Brillouin zone of the crystal. Although our discussion is focused 
on mechanics, our analysis applies when D is replaced by other linear 
operators, such as the Maxwell operator of a photonic crystal’’, the 
mean-field Hamiltonian of a quantum system (in which case the eigen- 
values are energies) or the dynamical matrix of an electrical circuit?” *. 

Twisted kagome lattices are a family of structures obtained froma 
mechanical kagome lattice’ » by actuating a mechanism, called a 
Guest-Hutchinson mode’, that allows a global deformation of the unit 
cells (see Supplementary Video demonstrating this property). This 
family is parameterized by a twisting angle 0 described in Fig. 2. We 
denote by 1(0) the dynamical matrix of the structure with twisting 
angle 0. To each twisted kagome lattice with a twisting angle 6 corre- 
sponds a dual mechanical structure, whichis another twisted kagome 
lattice with a twisting angle 6* = 20. — 8. Comparison of the lower pan- 
els of Fig. 2b, d reveals that two lattices related by a duality transforma- 
tion share the same band structure despite their clear structural 
difference. Remarkably, there is a self-dual kagome structure with angle 
6° = 6,+ 1/4, where the band structure is doubly degenerate over the 
entire Brillouin zone, as shown in the lower panel of Fig. 2c. We now 
prove that the explanation of these phenomenological observations 
can be traced to the existence of a mathematical duality between the 
dynamical matrices of pairs of kagome lattices. An application of our 
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Fig. 1| Symmetries and dualities. a, A star-shaped polygon in LEGO bricks 
illustrates threefold rotation symmetry: the mechanical molecule is mapped to 
itself by a 21/3 (120°) rotation C,. This is in contrast to a duality, which 
generically maps one system to another system. b, Inthe Ising model, spins ona 
two-dimensional lattice can take two values, +1, represented by black (white) 
pixels. A phase transition separates an ordered ferromagnetic phase at low 
temperature in which spins align (right panel) froma disordered paramagnetic 
phase at high temperature (left panel). The Kramers and Wannier duality! 
associates to each (inverse) temperature Z a dual temperature f*, and the ratio 
of the partition functions at B and B* isa known smooth function. The self-dual 
point f. =f corresponds to the critical phase (middle panel), where the phase 
transition between the ferromagnet and the paramagnet occurs. c, Twisted 
kagome lattices forma family of mechanical structures parameterized bya 
variable 6 called the twisting angle. (See Fig. 2 for amodel in LEGO bricks.) To 
each kagome lattice with angle 6 is associated a dual kagome lattice with angle 
6* =20.- 6, resulting in strong relations between their mechanical properties. 
At the self-dual point, where 6, = 67 = 7/4, the duality becomes asymmetry. 


general approach to electrical circuits is outlined inthe Supplementary 
Information. 

We first introduce a unitary transformation “acting on the vibra- 
tional degrees of freedom of atwisted kagome lattice as represented 
in Fig. 3. A direct calculation (see Supplementary Information) 
shows that 


U(k)D(0", - Uk) = D(O, k) (1) 
where //(k) is the Bloch representation of operator tH. Hence, should 


be viewed as a linear map between different spaces, describing the 
vibrations of the different mechanical structures with twisting angles 


M K K T M K 


M K K r 


Fig. 2| Twisted kagome lattices and their band structures. a, ALEGO bricks 
realization of the twisted kagome lattice tuned close to the critical point 6,. 
Lower inset: visualization of the twisting angle 0. The angle between two 
triangles is m— 20. Upper inset: unit cell of the mechanical structure. There are 
three inequivalent points (that is, not related by Bravais lattice translations) 
labelled M,, M,and M;. b-d, Band structures of the mechanical structures at 
different twisting angles. The physical frequencies are non-dimensionalized by 
acharacteristic frequency, @)=./(kp/m,), where k, and m, are the characteristic 
spring constant and the mass, respectively. The dual twisted kagome lattices 
with twisting angles 0. + A@ have the same band structure (band d). The self- 
dual lattice with twisting angle 6, (c) has an twofold-degenerate band structure 
(including for points outside high-symmetry lines). AttheI point, a double 
Dirac cone can be observed, highlighted by a blue shaded circle. The band 
structures are obtained by diagonalizing the Bloch dynamical matrices D(6, k). 
The masses m,of points M,are set to unity in units of mo. See Supplementary 
Information for details and a video demonstrating the collapse mechanism. 


6* and 8 (compare the two lattices in Fig. 3). Note that 2(k) does not 
depend on the twisting angle @. As Newton equations are real-valued, 
the Bloch dynamical matrix satisfies OD(0, k)O = D(@, -k) where O is 
complex conjugation. Hence, by combining the anti-unitary operator 
OwithU/, we get an anti-unitary operator A(k) =U(k)O, which squares 
to A(k)? =— Id (where ld is the identity matrix) and such that 


A(k)D(0", k) A (k) = D(8, k) (2) 
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Fig. 3 | Schematic action of the duality operator. The duality operator maps 
the vibrational degrees of freedom of atwisted kagome lattice to the 
vibrational degrees of freedom of the dual kagome lattice. The vibrational 
degrees of freedom (in blue, red and green) in aunit cell (bold outline) are 
rotated by 90° counterclockwise and translated to another unit cell. 
Importantly, the translation depends onthe degree of freedom: the vibrations 
of mass M, (in blue) are not shifted, whereas the vibrations of mass M, (in red) 
are shifted by one lattice vector a, and the vibrations of mass M; (in green) are 
shifted by another lattice vector —a,. The operator // is written asa block 
matrix; the different blocks describe the different masses in the unit cell (as 
represented by the colours), and the (real) matrices r, = io, (the square 


Equation (2) is the expression of a duality between the two lattices 
with twisting angles 6 and @, illustrated in Fig. 1c. The dynamical matri- 
ces of the two dual systems are related by aanti-unitary transformation. 
As aconsequence, they have identical band structures (more precisely, 
the eigenvalues are related by complex conjugation, and are equal 
because they are also real) and the eigenvectors are related by A. Equa- 
tion (1) is also a duality between the same lattices. In contrast to equa- 
tion (2), itis ruled by a unitary operator, butis non-localin momentum 
space (it relates kto —k). Alone, it would ensure that the band structures 
of both lattices are the same only up to an inversion of momentum. 

We now move onto explain the global twofold degeneracy observed 
in Fig. 2c. This degeneracy is reminiscent of a celebrated theorem from 
Kramers” stating that the energy states of time-reversal invariant 
quantum systems with half-integer spin are at least doubly degenerate. 
At first sight, this theorem does not apply here, as the mechanical 
degrees of freedom are neither quantum mechanical nor fermionic. 
Yet Kramers’ theorem canstill formally apply provided that some anti- 
unitary operator squaring to minus the identity matrix commutes with 
the dynamical matrix. At the critical twisting angle 6, = 02 = 1/4, the 
mechanical structure is self-dual. The duality shown in equation (2) 
acts as a hidden symmetry of the critical dynamical matrix D(@,), 
through A(k)D(0,, k) A (k) = D(6,, k). As A =- Id, Kramers’ theorem 
can indeed be applied, and implies that the band structure is twofold 
degenerate at every point k of the Brillouin zone, as observed in Fig. 2c. 
Interestingly, Aacts in the same way as the combination of spatial inver- 
sion and aso-called fermionic time-reversal would in an electronic 
system, although neither is present in our mechanical system. Owing 
to the presence of the self-dual symmetry, the critical band structure 
exhibits exotic features. For instance, a finite-frequency linear disper- 
sion (a double Dirac cone) is observed at the centre of the Brillouin 
zone (called I; see Fig. 2c) that is uncommon in systems with time- 
reversal invariance (see Supplementary Information for a discussion 
and references). 
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represents fourfold rotation onthe degrees of freedom) act for each mass on 
the two orthogonal vibrations along x and y, mapping (u,, u,) to (u,, -u,). The 
operator 7 acts on the Bravais lattice as space inversion, but does not modify 
the internal degrees of freedom (see Supplementary Information). Iterated 
applications of (/ show that if =-1d,0 =-tandét*=1d, showing that the 
symmetry has order 4. In the self-dual lattice, the transformation resembles a 
non-symmorphic symmetry composed of a 90° rotation followed by anon- 
integer lattice translation at first sight. However, further inspection shows that 
this operation is different from the duality operation, and is not asymmetry of 
the self-dual lattice (see Supplementary Information for a visual proof). 


When the self-duality is combined with the usual crystal symme- 
tries, anomalous point groups can be realized. Consider paving the 
two-dimensional plane witha single regular polygon. This is possible 
with a triangle, a square or a hexagon, but not with a pentagon ora 
dodecagon. This is a manifestation of the crystallographic restriction 
theorem: the only point group symmetries compatible with lattice 
translations are of order 1, 2, 3, 4 or 6, in two dimensions. (The order 
of an operation gis the smallest integer n such that g” is the iden- 
tity matrix.) The point group C,, of twisted kagome lattices at the 
centreI of the Brillouin zone contains threefold rotations (as visible 
in Fig. 2), perfectly compatible with this assertion. At the critical angle 
6., the duality relation (1) turns into an additional symmetry of the 
dynamical matrix. The point group at I has effectively to be supple- 
mented with 21), which has order 4 (see Fig. 3). Combined with a 
threefold rotation from C,,, the self-dual symmetry 7(T) produces an 
anomalous symmetry of order 3 x 4 = 12, making the effective point 
group atl non-crystallographic (isomorphic to D,,; see Supplemen- 
tary Information). The emergence of this non-crystallographic point 
group is curious, as the twisted kagome lattices are indeed crystals, 
not quasicrystals. However, there is no contradiction with the crystal- 
lographic restriction theorem, because the self-dual symmetry is not 
a spatial symmetry. 

Besides vibrational modes, the duality induces a relation between 
the elastic tensors c,,-(0)and c;,,(6*) that describe the static mechan- 
ical response of dual structures in the linear regime (see Supplementary 
Information). A striking manifestation of the enhanced symmetry at 
the self-dual point is revealed by considering the elastic tensors of a 
twisted kagome lattice in which inequivalent springs have different 
stiffnesses. In this case. there are no spatial symmetries besides trans- 
lations (the space group is p1). Inthe Supplementary Information, we 
show that three distinct elastic moduli exist for 0 4 6,, whereas at the 
self-dual point 0= 6, the elastic tensor becomes isotropic and only one 
non-vanishing modulus remains. 
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Fig. 4 | Mechanical spintronics vianon-Abelian geometric phases. Pushing 
onawavepacket with an effective force fmakes it move in momentum space 
and changes its pseudo-spinstate|@). This is described by the holonomy Wofa 
non-Abelian Berry connection, which acts on|g)as a single-qubit gate. a, The 
vibrational states WW ini) (WW0;.3)) obtained after the sequence of forces 
(fi. fr) (of (fa, f)) were applied for a duration rsuch that fr= a;* are represented 
by ellipses describing the motion of the masses, with the colour representing 
their phases. They are equivalently represented ona Bloch sphere 

(see Supplementary Information for a definition of the momentum-dependent 
basis). b, Sketch of the real space and momentum space trajectories 

(see Supplementary Information for the numerically integrated solutions). 
c,d, Toavoid non-adiabatic transitions, an asymmetry dmis introduced 
between the three masses ina unit cell (m,,m,,m,;) =(1-6m,1,1+6m), 


We now show how non-Abelian sound waves arise in our self-dual 
mechanical structures. Non-commuting (or equivalently non-Abelian) 
behaviour is pervasive in mechanics, from the moves of a Rubik’s 
cube to the non-holonomic dynamics of rolling spheres and robotic 
arms. Here, we focus instead ona more subtle phenomenon: the non- 
commutative behaviour of the classical excitations (that is, sound 
waves) that propagate on top of a background configuration. Note 
that because our reasoning does not rely ona particular length scale, 
it should also apply to nanostructures with quantum phonons. The 
propagation of a wavepacket constructed out of vibrational modes 
can be affected by geometric (or Berry) phases. For a single isolated 
band, the Berry phases are complex numbers of modulus one that 
manifestly commute. To obtain non-Abelian Berry phases, a set 
of (at least) two degenerate bands is required. The geometric 
phases then become 2 x 2 unitary matrices that do not need to 
commute®”, 


illustrated by circles of different sizes. As a consequence, the double Dirac cone 
becomes massive, with a gap (in grey) proportional to 6mat first order, while 
the global twofold degeneracy is preserved. e, An effective forcef,,, acting on 
the wavepackets is produced by applying a spatially varying harmonic 
potential that essentially shifts the optical bands in frequency proportionally 
to the potential (see Supplementary Information). f, Comparing W,W,W]!W;! to 
theidentity matrix provides a quantitative measure of the non-commutativity. 
The absolute value of the trace of their difference is plotted as a function of the 
starting point ky of the protocol. Inthe numerical simulations, we have 
considered a wavepacket projected on the bands with dispersions w;(k) =@,(k), 
and we have set ky = (2, 1) and 6m=0.1 (see Supplementary Information for 
details on the numerical computation and orders of magnitude). 


The self-dual kagome lattice is a suitable platform to realize non- 
Abelian sound because it has a global twofold-degenerate phonon 
spectrum, allowing us to realize a pseudo-spin mechanical degree of 
freedom |), represented by a Bloch sphere in Fig. 4a, b. To do so, we 
need to isolate a single twofold-degenerate band in the spectrum. This 
is done by assigning different values to the three masses in the unit cell 
to lift the threefold rotation symmetry of the lattice, creating a gap in 
the double Dirac cone at I (see Fig. 4c, d). This modification preserves 
the self-dual symmetry of equation (2), so the global twofold degen- 
eracy persists regardless of the values of the masses. Consider an acous- 
tic wavepacket constructed from the central set of twofold-degenerate 
bands with dispersions w,(k) = w,(k), where dispersion relations @(k) 
with i=1,...,6 are labelled with increasing frequencies (see Fig. 4d). As 
we apply external forces to the wavepacket as in Fig. 4b, it evolves within 
the twofold-degenerate subset of mechanical vibrations, as long as the 
external perturbation is small enough (see Supplementary Information 
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for orders of magnitudes). The wavepacket can then be treated as a 
particle-like object described by semiclassical equations of motion 
(see Supplementary Information and refs. 7°? *) that govern the evo- 
lution of its semiclassical position r(t), momentum k(t) and pseudo-spin 
(that is, polarization) |p(0). 

When aconstant force is applied to the wavepacket, its momentum 
k() increases linearly in time while its pseudo-spin state changes from 
an initial value|g,_,) toW[Cllg,,,,) (this situation is analogous to Bloch 
oscillations in solid-state physics). HereW[C] =P exp(-J. A) isa Wil- 
son line operator, the non-Abelian analogue of a Berry phase, that acts 
on|g) as a single-qubit gate, and A is the non-Abelian Berry connec- 
tion’*”” describing the twofold-degenerate band (see Supplementary 
Information). An effective force acting on wavepackets is not equivalent 
toa force acting onthe elements of the mechanical lattice. The effective 
force on the wavepackets can be obtained through an additional har- 
monic potential imposed to each mass with a spatially dependent stiff- 
ness, as illustrated in Fig. 4e. This effective force is applied for a duration 
tchosen so that the momentum changes by exactly one reciprocal 
lattice vector a,*. As a consequence, C,(A) = ky + Aa; is a closed loop 
starting and ending at k, (see Fig. 4b), and the Wilson loop W; = W[C,] 
is the holonomy of the Berry connection along C;,. 

Pictorially, pushing on the wavepacket changes its pseudo-spin state: 
this change is the holonomy W[C]. Here, the point is that the holono- 
mies do not commute: when one pushes on the wavepacket in different 
directions, the order of the pushes matters because the pseudo-spin 
state keeps track of what happened. This is represented in Fig. 4b. After 
the forces (f,, f,) are sequentially applied during the appropriate dura- 
tion, the pseudo-spin of a wavepacket initially at k, changes from any 
initial state|g,, ,) toW,W|9,,;). The reversed sequence (f,, f,) produces 
a different final pseudo-spin state W{W,|q,,;), because the two Wilson 
loops typically do not commute: 


WW, # WW, (3) 


These non-commuting mechanical responses share similarities with 
non-Abelian excitations such as anyons® °°, However, in the present 
study, non-commutativity arises from how independent wavepackets 
respond to external forces, whereas for anyons it is associated with 
the exchange (braiding) of these quasi-particles with each other. In 
Fig. 4f, we assess howthe choice of the initial point k, affects the non- 
commutativity of W,(k,) and W,(k,) by quantifying the deviation of 
W,W,W;'W;' from the identity matrix. 

Our results raise the prospect of materials in which information is 
encoded and processed using non-Abelian mechanical excitations and 
can be seenas a first step towards an extension of phononics” includ- 
ing mechanical pseudo-spins, which we call mechanical spintronics. 
More broadly, our work illustrates the power of duality relations in 
mechanics and wave physics. The counterintuitive degeneracies of 
elastic moduli and phonon spectra at the self-dual point suggest that 
dualities and their breaking may play as crucial a role in the design of 
metamaterials as symmetries currently do. 
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Methods 


Non-Abelian holonomies 

Geometric phases!**° ” describe the residual influence of its environ- 
ment onasubsystem considered in isolation. They typically arise when 
the system undergoes a slow cyclic change of its parameters. In this 
situation, the state of the system is transported over the parameter 
space to describe the evolution. The change in this state from before to 
after one cycleis a geometric phase factor. Formally, it is the holonomy 
of aconnection along the closed loop travelled in parameter space: 
the connection provides a covariant derivative describing the paral- 
lel transport in the vector bundle of system states. The holonomies 
(geometric phase factors) are not necessarily mere U(1) phases, but 
canbe matrices referred to as non-Abelian geometric phases”, because 
the formalism allows for non-commuting holonomies (phase factors). 
More precisely, let 


W(C)=P exp{- j. A) (4) 


be the holonomy of the connection along the curve C in parameter 
space. Here, A is the connection form (see Supplementary Information 
for details), called a non-Abelian gauge field in the context of gauge 
theory*. The quantity W (C)is also called a Wilson loop operator, and 
can be computed numerically (see Supplementary Information). We 
are interested in situations where two holonomies do not commute, 
that is, whenW(C,)W(C,) # W(C,)W(C,)for two closed loops C; starting 
(and ending) at the same point. 

In the main text, we consider the propagation of wavepackets ina 
lattice of coupled mechanical oscillators. In this situation, the restric- 
tiontoa subsystem described in isolation consists in assuming that the 
wavepacket stays in a given (set of degenerate) phonon bands, ignor- 
ing the bands with higher or lower frequencies. The parameter space 
is momentum space (that is, the Brillouin torus): the quasi-classical 
momentum changes because a force is applied to the wavepacket, and 
effectively acts as an external parameter from the point of view of the 
wavepacket (see Supplementary Information and refs. 7°*? 3445°>°), We 
emphasize that although the wavepacket moves both in momentum 
space and in position space, the non-Abelian geometric phase factors 
discussed in the main text are related to the trajectory in momentum 
space (not position space). 

Non-Abelian holonomies arise in various physical contexts. The most 
common situation corresponds to an abstract parameter space” ™*. In 
the situation analysed in the main text, the relevant parameter space is 
momentum space. This is similar to situations arising in the study of arti- 
ficial spin-orbit coupling and non-Abelian gauge fields in optics** °° and 
in ultracold atomic gases” *. Non-Abelian holonomies can also arise 
with position space as the relevant parameter space, suchas inthe non- 
Abelian Aharonov-Bohm effect* recently predicted and reported in 
optics”. Again, similar phenomenaarise in ultracold atomic gases” ©. 
In another context, (non-Abelian) anyons®* © can also be understood 
as (non-Abelian) holonomies®”; where the relevant parameter space 
is essentially describing the positions of the excitations. Interestingly, 
similar situations where classical excitations can be braided around 
one another were reported in optics and mechanics** **”, 

A fascinating facet of non-Abelian holonomies is the possibility of 
harnessing them to realize holonomic computations”. This idea was 
developed with quantum computation in mind, but it can indeed also 
be applied to classical systems (and hence, classical computation). 
From this point of view, the non-Abelian holonomies described in the 
main text can be seenas single-qubit gates. This is not enough to realize 
any quantum computation, even with quantum degrees of freedom 
(phonons). For that, one would need two-qubit gates, which would 
require an interaction between the wavepackets ignored in our analysis. 
Nonetheless, a suitable design of the pattern generating the effective 


forces should allow the realization of simple classical operations on 
the mechanical pseudo-spins carried by the wavepackets that would 
enrich the growing toolbox of phononic systems*””’. 


Semiclassical equations and Bloch oscillations 

In the main text, we consider a wavepacket centred at a position rand 
amomentum k. We assume that this wavepacket is composed of anar- 
row band of frequencies corresponding to the two degenerate bands 
with dispersions w,(k) = @,(k) (the same analysis could be done onthe 
upper bands w,(k) = @,(k)). Its composition in the corresponding two- 
dimensional space is summarized by a pseudo-spin (polarization) |g). 
Inthe context of wave physics, semiclassical approximations provide 
an approximate particle-like description of a wavepacket localized in 
both physical space and momentum space. For instance, geometrical 
optics can be viewed asa short-wavelength approximation of Maxwell 
equations”. Here, the position rand momentum kand the pseudo-spin 
|y)are promoted to the status of dynamical variables (called semiclas- 
sical variables) that evolve when the system is perturbed, for example 
when the wavepacket propagates ina non-uniform medium. The equa- 
tions describing the semiclassical dynamics of the wavepacket ina 
(perturbed) spatially periodic structure can be systematically obtained 
from the underlying wave equations”°*?3>-”? (see also refs. *+4>°° for 
application to classical waves). They read 


. On ; 
fH “8k, + iF’), k, 

. OV (5) 
ky aa ort 


@ =-(iO(k) +iV(r) + A“k,p 


where Q(k) = @,(k) = @,(k) is the dispersion relation of the relevant 
degenerate bands, V(r) is an external potential, and A(k) and F(k) are 
the matrix-valued non-Abelian Berry connection and curvature forms 
of the degenerate band'*** (see Supplementary Information for more 
details). 

The case of a constant (uniform) force f, = -0,V considered in the 
main text describes so-called Bloch oscillations®®, a situation realized 
in semiconductor superlattices® and in optical lattices of cold atoms™ 
that provides a powerful tomographic tool for Bloch states® **. In this 
case, the momentum equation k = f,can beintegrated to k(t) =ko + fot. 
The pseudo-spin can then formally be obtained up toa phase as a Wil- 
son loop along the momentum space trajectory (see Supplementary 
Information for more details). 


Fabrication of the kagome lattices in LEGO bricks 

The LEGO bricks realization of the kagome lattice allows us to demon- 
strate its collapse mechanism. It is composed of LEGO Technic liftarms 
connected by pins (see Supplementary Fig. 15). 

Each pincan be attached to at most four liftarms, at different heights 
h,,...,h,; and inthe kagome lattice, it must be attached to exactly four 
liftarms. As the liftarms are rigid, they should not be bent, so the two 
pins connected by a liftarm should be attached at the same height. 
Amusingly, this constraint is similar to the ice rule of the six-vertex 
model. A practical consequence is that the unit cell of the kagome 
lattice has to be enlarged in the LEGO bricks realization. In our design, 
the unit cell is doubled with respect to the original lattice in order for 
the vertices to satisfy the ice-like rule. It is composed of 12 ‘liftarms 
1x 6 thin’ (LEGO part 32063) and six ‘pins without friction ridges 
lengthwise’ (3673). 

The presence of edges induces additional (unwanted) zero-energy 
mechanisms besides the global Guest-Hutchinson mechanism. They 
are suppressed as muchas possible with an additional structure that 
prevents local motions without preventing the global deformation 
of the system. 
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Data availability 
No external data set was used during the current study. 


Code availability 


The code used to compute the band structures and the holonomies, to 
perform the group-theoretical analysis, to integrate the semiclassical 
equations of motion and to verify the duality relations is available on 
Zenodoat https://doi.org/10.5281/zenodo.3417426 under the 2-clause 
BSD licence. 
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Memristor-enabled neuromorphic computing systems provide a fast and energy- 


efficient approach to training neural networks’ *. However, convolutional neural 
networks (CNNs)—one of the most important models for image recognition*—have 
not yet been fully hardware-implemented using memristor crossbars, which are 
cross-point arrays with a memristor device at each intersection. Moreover, achieving 
software-comparable results is highly challenging owing to the poor yield, large 
variation and other non-ideal characteristics of devices® °. Here we report the 
fabrication of high-yield, high-performance and uniform memristor crossbar arrays 
for the implementation of CNNs, which integrate eight 2,048-cell memristor arrays to 
improve parallel-computing efficiency. In addition, we propose an effective hybrid- 
training method to adapt to device imperfections and improve the overall system 
performance. We built a five-layer memristor-based CNN to perform MNIST” image 
recognition, and achieved a high accuracy of more than 96 per cent. In addition to 
parallel convolutions using different kernels with shared inputs, replication of 
multiple identical kernels in memristor arrays was demonstrated for processing 
different inputs in parallel. The memristor-based CNN neuromorphic system has 

an energy efficiency more than two orders of magnitude greater than that of 
state-of-the-art graphics-processing units, and is shown to be scalable to larger 
networks, such as residual neural networks. Our results are expected to enablea 
viable memristor-based non-von Neumann hardware solution for deep neural 
networks and edge computing. 


CNNs have become one of the most important deep neural networks 
(DNNs)° and play a vital role in image-processing-related tasks, such 
as image recognition”, image segmentation and object detection”. 
Atypical computing procedure for a CNN involves a large number of 
sliding convolutional operations. In this respect, computing units that 
support parallel multiply-accumulate (MAC) calculations are highly 
desired. Such demand has led to the redesign of conventional comput- 
ing systems to operate CNNs with higher performance and lower power 
consumption, ranging from general application platforms, such as 
graphics-processing units (GPUs)®, to application-specific accelera- 
tors'*’, However, further improvements in computing efficiency will 
ultimately be constrained by the von Neumann architecture of these 
systems, in which the physical separation of memory and processing 
units results in substantial energy consumption and large latency in data 
shuffling between units". By contrast, memristor-enabled neuromor- 
phic computing provides a promising non-von Neumann computing 
paradigm in which the data are stored, thus eliminating the cost of data 
transfer’. By directly using Ohm’s law for multiplication and Kirchhoff’s 
law for accumulation, a memristor array is capable of implementing 
parallel in-memory MAC operations, leading to analogue in-memory 
computing with greatly improved speed and energy efficiency’. 


Studies on memristor-based neuromorphic computing have covered 
abroad range of topics, from device optimization to system implemen- 
tation®” ”. Several experimental demonstrations*™ * related to practi- 
cal applications of in-memory computing have been reported as well. 
The most recent studies report the demonstrations of two-layer* and 
three-layer?” memristor multi-layer perceptronsfor image recognition 
using the MNIST (Modified National Institute of Standards and Technol- 
ogy) handwritten-digit database’’. However, a complete CNN, whichis 
essential for more complex image-recognition tasks, has not yet been 
demonstrated ina fully memristor-based hardware system. The reason 
mainly pertains to the lack of an efficient solution for the implementa- 
tion”’ of amemristor-based CNN (mCNN): first, the fabricated MCNN 
usually suffers from a poor yield and non-uniformity of memristor 
crossbar arrays*’®. Second, it is difficult to achieve a performance (for 
example, image-recognition accuracy) comparable to software results 
owing to device imperfections, such as variations, conductance drift 
and device state locking®®. Third, the key convolutional operation in 
CNNs is time-consuming because of the need to slide over different 
input patches, whichis usually a sequential process and results in speed 
mismatch between the memristor convolver and the memristor array 
for fully connected vector—matrix multiplication (VMM). 


‘Institute of Microelectronics, Beijing Innovation Center for Future Chips (ICFC), Tsinghua University, Beijing, China. *Beijing National Research Center for Information Science and Technology 
(BNRist), Tsinghua University, Beijing, China. Department of Electrical and Computer Engineering, University of Massachusetts, Amherst, MA, USA. *e-mail: wuhq@tsinghua.edu.cn 
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Shared blocks 


Fig. 1| Memristor-based hardware system with reliable multi-level 
conductance states. a, Schematic of the system architecture with eight 
integrated memristor PE units and other functional blocks. DRAM, dynamic 
random-access memory; ARM core, control unit with ARM (Acorn RISC 
Machine) architecture. b, Left, photograph of the integrated PCB subsystem, 
also knownas the PE board. Right, image of a partial PE chip consisting ofa 
2,048-memristor array and on-chip decoder circuits. c, Cumulative probability 


In this study, a complete five-layer MCNN for MNIST digit image 
recognition was successfully demonstrated. The optimized material 
stacks enabled reliable and uniform analogue switching behaviours 
in 2,048 one-transistor-one-memristor (1T1R) arrays. With the pro- 
posed hybrid-training scheme, the experimental recognition accuracy 
reached 96.19% for the entire test dataset. Furthermore, replication of 
the convolutional kernels to three parallel memristor convolvers was 
implemented to reduce the mCNN latency roughly by a factor of 3. Our 
highly integrated neuromorphic system provides a feasible solution 
to substantially improve the CNN efficiency by closing the throughput 
gap between memristor-based convolutional computation and fully 
connected VMM. 

Realizing a practical memristor-based neuromorphic computing 
system usually requires the integration of multiple memristor crossbar 
arrays. In general, splitting the weights into different arrays is beneficial 
for parallel computing, which is increasingly needed with increasing 
network scales. However, previous memristor-based demonstrations 
relied onasingle array*”*”*, mainly because of the challenge of produc- 
ing highly repeatable arrays. The variability and non-ideal character- 
istics of memristive devices are considered as substantial hurdles to 
the application of neuromorphic computing’ ’. 

Here we propose a versatile memristor-based computing architec- 
ture for neural networks, shown in Fig. 1a. The memristor cell uses 
a material stack of TiN/TaO,/HfO,/TiN, and shows continuous con- 
ductance-tuning capability (see Supplementary Information) in both 
potentiation (SET) and depression (RESET) by modulating the electric 
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distribution of 1,024 cells with respect to 32 independent conductance states. 
The conductance is equivalently represented by the read-out current undera 
0.2-V voltage pulse. For programming, the SET conditions were Vy, =1.8 V for 
the word-line voltage, V,, = 2.0 V (50 ns pulse) for the bit-line voltage and 

V., = 0 V for the source-line voltage; the RESET conditions were Vy, =4.7 V, 
Vg, =0 Vand V,, =1.8 V (SOns pulse). 


field and heat”’. The materials and fabrication process (see Methods for 
details) are compatible with the conventional CMOS (complementary 
metal-oxide semiconductor) process, so that the memristor arrays can 
be conveniently built in the back end of line ina silicon fab to reduce 
process variations and achieve high reproducibility. The fabricated 
crossbar arrays exhibit uniform analogue switching behaviours under 
identical programming conditions. Hence, a multiple-memristor-array 
hardware system (see Supplementary Information) was built using a 
customized printed circuit board (PCB) and a field-programmable 
gate array evaluation board (ZC706, Xilinx). As the system sche- 
matic shows, the system mainly consists of eight memristor-based 
processing elements (PEs). Each PE has its own integrated 2,048-cell 
memristor array. Each memristor is connected to the drain terminal 
of a transistor, namely, ina 1T1R configuration (see Supplementary 
Information). The core PCB subsystem with eight memristor array chips 
is presented in Fig. 1b. Each memristor array (right inset of Fig. 1b) has 
anassembly of 128 x 161T1R cells. There are 128 parallel word lines and 
128 source lines horizontally, and 16 bit lines vertically (see Methods for 
details). This array exhibits remarkably repeatable multi-level conduct- 
ance states, as shown by the test results in Fig. lc and the measured data 
from the remaining 2,048-cell arrays in Extended Data Fig. 1. Figure 1c 
shows the distribution of 1,024 memristors in 32 different conductance 
states, where all the curves are separated without any overlap. Identical 
SET and RESET pulse trains with a pulse width of SO ns were employed in 
the closed-loop programming” operations to reacha certain conduct- 
ance state. The measurement flow is described in Methods. 
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Fig. 2| Five-layer mCNN with memristor convolver. a, Structure of the five- 
layer mCNNused for MNIST image recognition, with alternating convolutional 
(C1, C3) and subsampling (S2, S4) layers. The input is a 28 x 28 greyscale (8-bit) 
digit image. The mapping relations between the weights of different layers and 
the independent PEs are illustrated (see Methods for details). The top labels 
give the kernel size (input channel depth x filter height x filter width ~ filter 
batch size) for Cland C3, the pooling size for S2 and S4 and the weight size for 
the FC layer (input neuron number x output neuron number). The bottom 
labels provide the feature map dimension (height x width x channel depth) or 
the vector dimension. b, Typical convolutional case during the slipping 


As shown in Fig. 2a, a five-layer CNN was constructed ona memristor- 
based hardware system to recognize MNIST handwritten-digit images. 
The detailed data flow in the CNN and the corresponding memristor 
mapping are described in Methods. 

Realizing memristor-based convolutional operations requires per- 
forming sliding operations with various kernels. Memristor arrays are 
highly efficient in achieving parallel MACs under shared inputs for 
different kernels”. Figure 2b shows a typical convolution example ata 
particular slipping step, and Fig. 2c reveals the associated events inthe 
1TIR memristor array. The input value is encoded by the pulse number 
according to its quantized bit number (Extended Data Fig. 2). Asigned 
kernel weight is mapped to the differential conductance of a pair of 
memristors. In this manner, all the weights of a kernel are mapped to 
two conductance rows: one row for positive weights with positive pulse 
inputs and the other for negative weights with equivalent negative 
pulse inputs. After inputting the encoded pulses into the bit lines, the 
output currents through the two differential source lines are sensed 


BL8 BL9 


process. The grey box in the image confines the input patch of this sample case. 
Xmnindicates the relevant pixel at the crossing of rowmandcolumnn. Kernels i 
and/each have a total of 3 x 3 weights. c, The equivalent memristor convolver of 
the convolutional operation in b. Throughout the entire parallel computing 
process, all word lines (WL) are set to Vw, =4.5 V. The injected bit line (BL) pulses 
are 0.2 V, and the source lines (SL) are clamped at O V. wrepresents the element 
value in the weight matrix, andg* andg’ represent the device conductance 
values for the positive and negative weights in the differential pair, 
respectively. 


and accumulated. The differential current is the weighted sum corre- 
sponding to the input patch and the chosen kernel. Different kernels 
with different weights are mapped to different pairs of differential 
rows, and the entire memristor array operates MACs in parallel under 
the same inputs. All the desired weighted-sum results are obtained 
concurrently. 

Intypical CNNtraining, itis necessary to propagate the objective deriva- 
tive backwards with respect to the last outputs, to determine all weight 
updates”. This task requires highly complex operations to apply encoded 
read pulses to source lines from back to front and layer by layer. Further- 
more, it is challenging to train a complicated memristor DNN, owing to 
non-ideal device characteristics, such as nonlinearity and asymmetric 
conductance tuning®””. In contrast to the pure in situ training solution, 
the ex situ training method appears to bea shortcut that takes advantage 
of existing high-performing parameters. However, inevitable hardware 
imperfections, suchas defective devices and parasitic wire resistance and 
capacitance, would blur the weights and degrade the system performance 
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Fig. 3 | Hybrid training onthe mCNN. a, Flowchart of the hybrid-training 
method used in this experimental demonstration. b, Diagram of the 
experimental mCNN demonstration with hybrid training. First, the system 
transfers the kernel weights of different convolutional layers and the 192 x10 
FC weights to the memristor PEs. Next, the system maintains the kernel weights 
unchanged and updates only the FC weights through in situ training. c-e, 


whentransferring the ex situ learned weights to memristor conductances’. 
Therefore, ex situ training normally requires prior knowledge of the hard- 
ware situation and learns weights on the basis of this costly awareness using 
software. 

To circumvent various non-ideal device characteristics, a hybrid- 
training method is proposed to implement the mCNN. The entire flow- 
chart, illustrated in Fig. 3a, includes two stages. First a CNN model is 
trained ex situ, and then all the determined weights are transferred to 
the memristor PEs by a closed-loop writing method. In the next step, 
the external input propagates forwards through the mCNN, and only 
the last fully connected (FC) layer is trained in situ afterwards to tune 
the memristor conductance. It should be pointed out that the proposed 
hybrid learning method is different from typical transfer learning”””°. 
Hybrid training aims to accommodate the device variations in the pre- 
vious layers to implement the parallel mCNN efficiently through the 
in situ training of the memristor weights, whereas transfer learning 
typically retrains the weights of the FC layers (hereafter, FC weights) 
using software to obtain knowledge ona new dataset. 

Here the hybrid-training technique emphasizes the training of FC 
weights to compensate for existing device imperfections, and it could 
be extended as a generic system approach to address device variations 
and other non-ideal device characteristics by in situ tuning of some 
memristor weights. Hybrid training is applicable to a wide range of 
neural network models and could be used to address non-ideal device 
characteristics regardless of the specific type of memristor device. 
However, it is worth mentioning that, compared with traditional ex situ 
training, hybrid training requires fetching the training data to realize 
in situ conductance tuning; therefore, additional memory blocks or 
data-transmission modules might be required. 

Amemristor model is established to validate that the in situ training 
of only the FC layer is generally adequate for compensating for device 


644 | Nature | Vol577 | 30 January 2020 


Weight transfer 


=p 


Unchanged 


Weight transfer 


f 2 z* 
15 —e— Testset —e— Training set 

Initial error rate on test set, 4.93% 

gS (after weight transfer) 

107 Final error rate on test set, 3.81% 

® (after hybrid training) 

re} 

i Error rate decreases by 1.12% 

Bil iN RO ie at Sania Se ge, oo 


200 300 400 500 600 


Iteration index 


0 100 
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imperfections and that it yields remarkable generalization results 
(see Methods for details). Inthis manner, hybrid training uses the advan- 
tages of both ex situ and in situ training, which complement each other. 

To realize an mCNN with hybrid training (Fig. 3b), a model (Fig. 2a) 
was trained ex situ in Python with TensorFlow ona training set contain- 
ing 55,000 images. The recognition accuracy for the 10,000 test digit 
images was 97.99%, which was taken as the baseline accuracy. The well 
trained weights were rescaled to meet the unified memristor conduct- 
ance window and quantized from 32-bit floating type to 15-level fixed- 
point type (see Methods for details). Reasonable weight quantization 
caused a tolerable performance degradation; for example, the 4-bit 
quantization of kernel weights and 2-bit quantization of the FC weights 
for atypical CNN model, AlexNet, was shown to induce a 2.60% increase 
in the recognition error for ImageNet (a widely used image database 
for visual object recogoniction) classification compared with the 32-bit 
quantization of kernel weights and FC weights*. The quantization of 
the 15-level fixed point relaxed the conductance mapping requirements 
to speed up weight transfer, and ensured a high recognition accuracy 
of 96.92%, close to the software baseline. 

Subsequently, the quantized kernel weights of the convolutional lay- 
ers and the weights of the FC layer were transferred to the correspond- 
ing memristor conductance (Fig. 2a). The weight-transfer accuracy 
distributions of the convolutional layers C1 and C3 and the FC layer 
are shown in Fig. 3c-e. The error distributions probably arise from 
device variations, conductance drift and state locking. The memris- 
tor hardware system still achieves a recognition accuracy of 95.07% 
(see Methods for details) on the 10,000 test images—a 2.92% accuracy 
loss compared with the baseline value of 97.99%. After an epoch of 550 
training iterations (a mini-batch of 100 training images was fed into the 
mCNN for oneiteration) onthe entire training database, the recognition 
error rate for the 10,000 test images decreased considerably fromthe 
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Fig. 4 | Parallel memristor convolvers with hybrid training for improving 
convolutional efficiency. a, Sketch of the hardware system operation flow 
with hybrid training used to accommodate non-ideal device characteristics 
for parallel memristor convolvers. Three batches of input images (handwritten 
digits onthe left) are fed into three PE convolver groups. All the processed 
intermediate data are fed into the shared FC PEs tocomplete the in situ tuning. 
In the neural network schematic, the blue part represents convolutional layer 
Cland subsampling layer S2, and the green part represents convolutional layer 
C3 and subsampling layer S4. In the PE diagram, the blue region represents the 
kernels of the Cl layer and the green region represents the kernels of C3 layer. 


initial value of 4.93% to the final value of 3.81% (Fig. 3f). The error rate 
onthe training set also dropped from 4.82% to 3.21%. 

Inamemristor-based neuromorphic computing system, the accuracy 
loss is mainly attributed to two factors: first, the presence of non-ideal 
device characteristics, suchas device variations, array yield problems 
and device reliability issues; second, the limited precision due to weight 
quantization. Eventhough the accuracy is not fully recovered given the 
limited quantization precision, the experimental results suggest that 
the hybrid-training method could effectively recover high recogni- 
tion accuracy by accommodating device variations across different 
memristor crossbars. It should be emphasized that in this in situ train- 
ing process, only the FC weights are updated in an analogue fashion, 
instead of retraining all the conductance weights. The detailed training 
procedure is described in Methods. 

Further experiments were conducted to show the effect of conduct- 
ance drift on system performance (see Methods and Extended Data 
Fig. 3). According to the test results, the reliability of multiple conduct- 
ance states needs to be further investigated and improved by material 
and device engineering, which remains an active research area. 

Althougha memristor convolver can realize the different kernels of 
a convolutional layer in parallel under shared inputs (Fig. 2b), operat- 
ing an MCNN remains time-consuming owing to the need to provide 
different patches of input during the sliding process. Considering 
that memristor-based MAC computing is more efficient and straight- 
forward when used as the VMM of the FC layer, the severe speed mis- 
match between the memristor FC implementation and the memristor 
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b-d, Weight-transfer error distributions for the mapping from the ex situ- 
trained kernel weights of the Cland C3 layers onto three different groups: G1 
(b), G2 (c) and G3 (d). The size of the colour map is 104 x 9. The colour bar 
represents the error in the transferred current values at a0.2-V read pulse. 

e, Evolution of experimental FC weight distributions (120 x 16) before (top) and 
after (bottom) hybrid training. f, Distribution of conductance-weight changes 
corresponding toe. g, The error rate obtained onthe test set after hybrid 
training is considerably lower than that measured immediately after weight 
transfer for eachconvolver group. 


convolver” would induce sizeable efficiency loss. Replicating the same 
group of weights in multiple parallel memristor arrays appears to bea 
promising way to recognize an input image efficiently inan mCNN. Spa- 
tial parallelism of the memristor convolvers could expedite convolu- 
tional sliding tremendously. In practice, transferring the same weights 
to multiple parallel memristor convolvers is challenging because of 
unpredictable and inevitable device variations, conductance drift and 
state locking® °, which would induce unavoidable and random mapping 
error distributions. This process could result in substantial system 
generalization loss and is therefore considered as a major bottleneck 
for the realization of an efficient mCNN™. 

A five-layer CNN with three duplicated parallel convolvers on the 
eight memristor PEs was successfully established in our full hardware 
system. Hybrid training was again used to address the non-ideal device 
characteristics. The approach used to perform hybrid training in the 
parallel operating scheme is sketched in Fig. 4a. In the beginning, the 
ex situ trained weights were transferred to all eight memristor PEs. 
Specifically, the kernel weights of the Cl and C3 layers were mapped to 
three independent groups of PEs. All three parallel memristor convolv- 
ers were connected to common PEs of shared FC weights. The specific 
mapping details are shown in Fig. 4a. Figure 4b, c, dshows the accuracy 
distributions of the total kernel weights (both C1 and C3 layers) after 
the weight transfer with respect to the first, second and third groups of 
PEs. It is clear that inevitable mapping errors exist in each group. The 
subsequent in situ training of the FC weights (see Methods for details) 
compensates for the device imperfections naturally. We ran 100 rounds 
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(that is, 300 batches) to reacha stable recognition accuracy. Figure 4e 
illustrates the transition of the FC conductance weights before and after 
the in situ training, and Fig. 4f presents the related distribution of the 
change in FC weights. After the in situ training of the FC memristors, 
the error rate decreased accordingly. Figure 4g shows that the error 
rates with respect to the memristor PE groups G1, G2 and G3 decreased 
from 4.79%, 6.60% and 6.20% to 3.41%, 4.86% and 3.86%, respectively 
(see Extended Data Fig. 4 for results on the training set). By dividing 
one input into three fraction regions uniformly from top to bottom, 
the parallel memristor convolvers could accelerate the forward process 
ona single image. The three convolvers operated on their associated 
input parts simultaneously, and their outputs were fed together into 
the FC layer to complete the classification. The experimental results 
show that hybrid training could boost the recognition accuracy on 
the 10,000 test images from 93.86% to 95.83%. Moreover, we carefully 
evaluated the hardware performance of memristor-based neuromor- 
phic computing using the experimental data (see Methods, Extended 
Data Fig. 5 and Extended Data Tables 1, 2). The performance benchmark 
of the memristor-based neuromorphic computing system shows 110 
times better energy efficiency (11,014 GOP s!W‘; 1 GOP = 10” opera- 
tions) and 30 times better performance density (1,164 GOP s' mm”) 
compared with Tesla V100 GPU”. It should be mentioned that some 
necessary functional blocks (such as the pooling function, the activa- 
tion function, and the routeing and buffering of data between different 
neural-network layers) were not considered in the comparison. These 
blocks could be integrated monolithically with the memristor arrays 
in the future and accounted for in the energy efficiency calculation. 

These findings suggest that the parallel memristor convolvers 
are highly efficient in achieving a high recognition accuracy while 
greatly accelerating the mCNN. In addition, the method of replicating 
the same kernels to different memristor convolvers could be scalable 
to larger CNN models to boost the parallel computing efficiency. The 
associated expenditure of chip area could be minimized in the future 
by employing high-density integration of memristors””. A standard 
residual neural network, ResNET-56", with acompact memristor model 
was explored on the CIFAR-10 database and exhibited only a slight 
accuracy drop of 1.49% compared with the software baseline of 95.57% 
(see Methods and Extended Data Fig. 6). 

Here, we proposed a hybrid training method to maintain high training 
efficiency and accuracy ina multiple-crossbar memristor CNN system. 
We should mention that although a small subset of the training data 
is sufficient in hybrid training, additional memory or data-transfer 
modules might be required. Moreover, a higher weight quantization 
precision is needed to fully recover the system accuracy, but at the cost 
of more hardware resources. Meanwhile, the system performance could 
be further enhanced by optimizing the peripheral circuits—especially 
the analogue-to-digital converter (ADC) blocks—and improving device 
reliability. 

In summary, we have experimentally demonstrated a complete 
mCNN with hybrid training and parallel computing on multiple mem- 
ristor arrays. The hybrid-training method is a generic system-level 
solution that accommodates non-ideal device characteristics across 
different memristor crossbars for various neural networks, regardless 
of the type of memristor device. The parallel convolution technique, 
which replicates weights to multiple memristor arrays, is proposed 
to eliminate the throughput gap between memristor-based convolu- 
tional computation and fully connected VMM. Generally, this technique 
could be extended to other memristor-based neuromorphic systems 
to efficiently boost their overall performance. The benchmark of our 
memristor-based neuromorphic computing system shows more than 
two orders of magnitude better power efficiency and one order of 
magnitude better performance density compared with Tesla V100 GPU. 
We expect that the proposed approach will enable the development of 
more powerful memristor-based neuromorphic systems. 
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Methods 


Fabrication of 1TIR memristor array 

The fabricated memristor array has a1T1R structure (see Supplemen- 
tary Information) in which the memristor stacks are TiN/TaO,/HfO,/ 
TIN. This array has a high operation speed of -10 ns, ahigh yield (99.99%) 
and robust endurance performance. 

Alltransistors and major metal interconnections and vias are fabri- 
cated ina standard CMOS foundry. The technology node is 130 nm. The 
back end of line—that is, the procedure used to complete the memris- 
tor stacks and the remaining top metal interconnections and vias—is 
processed in the laboratory. The bottom electrode layer of TiN, the 
switching layer of HfO,, the capping layer of TaO, and the top electrode 
layer of TiN are deposited sequentially after receiving the wafers from 
the foundry. The capping layer is used as a thermally enhanced layer** to 
modulate the distribution of the electric field and heat in the switching 
layer for improved device behaviour. Afterwards, alithographic process 
is adopted to form isolated 0.5 pm x 0.5 um memristor stacks. Then, 
the SiO, dielectric is added and polished. The final steps of etching the 
vias, depositing aluminium and shaping the remaining interconnection 
patterns are performed to complete the fabrication process. 


Structure of memristor array 

A PE chip (Fig. 1b) integrates on-chip encoder circuits and a128 x 16 
1T1R memristor array (see Supplementary Information). The memristor 
array is constructed by connecting the top electrodes of 128 memris- 
tor devices on the same column (that is, bit line) and the 16 transistor 
sources on the same row (that is, source line). The transistor gate ports 
facilitate fine memristor-conductance modulation by controlling the 
device’s compliance current with a specific applied gate voltage. The 
gates inaroware connected to the same line (thatis, word line), which 
is parallel to the source line. This memristor array acts as a pseudo- 
crossbar of two-port memristors by operating all transistors in the 
deep-triode region. 


Measurements of multi-level conductance states 

To measure the reliability of multi-level conductance (see Fig. 1c) in 
the array, we used a closed-loop writing method with identical SET and 
RESET pulses. During the test, we supplied the programming pulses 
to 1,024 randomly chosen memristors from the array to reach 32 indi- 
vidual conductance targets. These target states were distributed within 
the switching window from 2 pS (that is, 0.4 pA at 0.2-V read voltage) 
to 20 pS (that is, 4 pA at 0.2-V read voltage) with a uniform interval 
of 0.58 pS (that is, 116 nA at 0.2-V read voltage). For any desired con- 
ductance state, such as /, at a 0.2-V read voltage, we established the 
maximum programming pulse number to be 500. In addition, we set 
the defined target margin parameter A/to be +50 nA. When writing an 
individual cell to this conductance /, from any initial state, we continu- 
ously applied operating pulses up to the maximum pulse number, and 
the real-time conductance value was sensed aS /,.,q at a 0.2-V read volt- 
age after each programming pulse. If/,.,4 was within the desired range, 
from /,— A/to/,+ Al, the procedure ended successfully. Otherwise, a 
subsequent SET or RESET pulse was applied accordingly (see Supple- 
mentary Information). This entire process was conducted repeatedly 
over the chosen memristors for the 32 conductance targets. The low- 
conductance switching range and succinct operation with identical 
programming pulses could be used to simplify the system design and 
achieve low-power monolithic integration. 


Structure of the five-layer CNN 

As shown in Fig. 2a, a Cl layer measuring 26 x 26 x 8 
(weight x height x depth) is acquired after convolution with kernel 
weights measuring 1 x 3 x 3 x 8 (depth x weight x height x batch). The 
result is subsampled by a pooling layer (S2), that uses a3 x 3 max- 
pooling operation over the input with a sliding stride of 3. Then, a C3 


layer is formed with 12 stacked feature maps after convolution with 
the8 x3 x3 x12kernels. Another pooling layer (S4, 4 x 4 x 12) is subse- 
quently formed using a2 x 2 max-pooling operation witha stride of 2. 
Then, the expanding 192-element vector is passed into the FC layer to 
obtain the final 10 probability outputs, determining the class to which 
the input belongs. The inset (dashed box) clarifies how to map the total 
weights of different layers to memristor PEs of the hardware system. 
In the experimental demonstration, 9 of 16 memristors in a row were 
used to realize a3 x 3 kernel, and the residual cells remained unused. 
Hence, the 1 x 3 x 3 x 8 kernel weights of the C1 layer required 16 dif- 
ferential rows of memristors (PEI), and the 8 x3 x 3 x 12 kernel weights 
of the C3 layer required 192 differential rows of memristors (PE1 and 
PE3). Owing to the limited number of memristors per row (that is, 16), 
we split the total 192 weights connected to an output neuron in the 
FC layer into 24 differential rows and gathered all the corresponding 
currents of the 12 positive weight rows and 12 negative weight rows 
(see Supplementary Information). Thus, we were able to map the total 
FC weights to PES (120 rows) and PE7 (120 rows) to carry out the equiva- 
lent VMM of the FC layer. 


mCNN demonstration 

Atypical CNN modelis created by stacking convolutional and pooling 
layers repeatedly in series, followed by one or two FC layers at the end. 
Here we implemented a complete five-layer CNN with our memristor- 
based hardware system to recognize MNIST handwritten-digit images. 
The CNN model employed is shown in Fig. 2a. The model contains 
two convolutional layers, two pooling layers and one FC layer. The 
max-pooling and ReLU (rectified linear unit) activation functions are 
employed. The images in this dataset are categorized into 10 classes 
numbered 0 to 9. The input layer has 784 neurons, which is consist- 
ent with the number of pixels in the 28 x 28 input image. There are 
eight 3 x 3 kernel weights for the first convolutional layer (C1 layer in 
Fig. 2a) and twelve 3 x 3 x 8 kernel weights for the second convolutional 
layer (C3 layer in Fig. 2a). The convolutional operation is conducted 
by calculating the weight sums between the shared local kernel and 
the generated input patch of the input layer during continuous slid- 
ing witha fixed stride step. This operation could be decomposed into 
parallel MAC operations, which are naturally amenable to a memristor- 
based in-memory-computing architecture. The input patch is unrolled 
into anine-dimensional vector. The hardware system then drives nine 
channels of pulses accordingly to be supplied to nine bit lines simul- 
taneously. A weight is represented by two differential 1T1R memristor 
conductances, and thereby a kernel is mapped throughout to the cor- 
responding positive and negative weight rows. The difference in the 
cumulative flowing currents through these two related source lines is 
precisely the desired weighted sum of the kernel weights and the input 
patch. The elements of the second pooling layer (S4 layer in Fig. 2a) are 
flattened and expended as a192-dimensional vector to be passed into 
the last FC layer, and then the weighted-sum values are fed as the input 
of the softmax function to compute the classification probability. In 
this manner, the system leads to a map from the original digit image 
to the ten output probabilities of the last layer. Each output neuron 
is associated with a defined digital class. The largest among the out- 
puts indicates that the CNN classifies the input image to the matching 
category accordingly. The associated pooling and ReLU activation 
functions, as well as the update-calculating modules (such as those 
computing softmax outputs and weight gradients), were realized by 
running the codes on ARM cores. 


Hybrid training ona subset of the training images 

We trained the five-layer CNN model in Python and reached 97.99% 
recognition accuracy on the test set. The extracted memristor compact 
model was then used to validate that in situ learning of the FC conduct- 
ance weights is generally adequate for tolerating device imperfections. 
After transferring the weights, the recognition accuracy dropped from 
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97.99% to 95.63% owing to the non-ideal memristor characteristics. 
Afterwards, all possible combinations of different layers of the weights 
were tuned—that is, we tried to train the FC weights only, the weights 
of Cl only, the weights of C3 only, the weights of the FC layer and the 
C3 layer together, etc. Five epochs of measurements were conducted 
onthe entire training set for the arch trial. As shown in Extended Data 
Fig. 4b, tuning the FC conductance weights only is most efficient for 
regaining aremarkable generalization result. Essentially, this approach 
guarantees a high recognition accuracy and simplifies the original end- 
to-end training flow by skipping the backward propagation. 

Furthermore, we experimentally validated that only a small subset 
of the training data is sufficient to recover the initial system accuracy 
using hybrid training, which helps to minimize the hardware resources 
needed for fetching training data. A five-layer CNN (shown in Fig. 2a) 
was employed to demonstrate that only 10% of the training dataset is 
enough to regain the high recognition accuracy of the system. Similarly 
to the experimental procedure, the trained weights were first trans- 
ferred to the memristor PEs, and during the transfer some mapping 
errors were intentionally added by replacing 10% of the target weights 
with random values; accordingly, the recognition accuracy was reduced 
to 80.66%. Then 5,500 training images were randomly chosen from 
the total training dataset (that is, 10% of the 55,000 training images) 
to update the weights of the FC layer. After performing hybrid training 
as described in the main text, the accuracy was increased up to 94.40% 
after ten training epochs. To prove the robustness of our hybrid train- 
ing technique, the experiment was conducted two more times, and the 
result is shown in Extended Data Fig. 6c. 

Furthermore, a typical ResNET-56 model was used to validate that 
asmall subset of the total training dataset is adequate for recovering 
the high initial accuracy of the system using hybrid training. The ini- 
tial accuracy achieved using software was 95.57% (training with 32-bit 
single-precision floating-point weights), which was degraded to 89.64% 
after the quantization using 15-level weights. Subsequently, the quan- 
tized weights were mapped tothe memristor arrays with the established 
device model in the weight-transfer stage, and the recognition accuracy 
dropped accordingly to 79.76%. Afterwards, we evaluated the system 
accuracy after hybrid training using 3% of the total training dataset, 
thatis, 1,500 images from a total of 50,000 training samples. During the 
simulation, ten trials were made. The final result is plotted in Extended 
Data Fig. 6d, which depicts the recognition accuracy associated with 
the key phases of the whole simulation process. It was found that a 
small subset (3%) of the training data is enough to guarantee a high 
recognition accuracy (92%)—a 3.57% precision decline against the soft- 
ware result. This simulation result is consistent with the experimental 
results described above. 


The 15-level conductance weight 

A4-bit weight is generally sufficient to achieve a high recognition accu- 
racy for CNNs”"”>. In this work, an approximate 15-level fixed-point 
weight was adopted as the differential conductance of a pair of 8-level 
memristors. The smaller number of conductance states needed within 
the switching window leads to faster weight transfer because a larger 
target margin is permitted in the closed-loop writing. Writing an arbi- 
trary 15-level fixed-point number to a differential pair of memristors 
obviously calls for aconsistent ability to distinguish among eight con- 
ductance states in each device. In addition, such writing requires that 
these states be separated within the switching window over the same 
interval. During the corresponding experiment, the conductance was 
programmed from 2.5 pS (0.5 pA at a 0.2-V read pulse) up to 20 pS (4 WA 
at a0.2-V read pulse) with a constant step of 2.5 pS. The equivalent 
15-level weight of the memristor pair was thus referred to the 15 indi- 
vidual differential conductance values that were uniformly distributed 
from negative 17.5 pS (2.5 pS-20 pS) to positive 17.5 pS (20 pS-2.5 pS). 
Moreover, the effect of read disturbance on the 15-level conductance 
weights after applying 10° read pulses (0.2 V) isinvestigated in Extended 


Data Fig. 7. The experimental data from the array-level tests show that 
the read operations with the 0.2-V pulse do not disturb the conductance 
states markedly or systematically. 


Estimation of number of programming pulses and 
programming currents 

It is critical to assess the required number of programming pulses in 
the closed-loop programming system to benchmark the system per- 
formance. To estimate the number of programming pulses required 
to stably converge the memristor to a desired conductance state, we 
randomly selected 24 rows of ITIR memristor devices and programmed 
them to high conductance states, that is, >20 pS (4.0 pA at a0.2-V read 
pulse). Afterwards, we divided these devices to eight groups, each with 
three rows. These eight groups of memristors were correspondingly 
written to eight different conductance states, from 0.5 pA to 4.0 pA 
witha uniform interval of 0.5 pA under a read voltage of 0.2 V. The error 
margin was set as +100 nA for the eight states. Then the required pulse 
numbers were analysed statistically on the basis of the measured data, 
and they are shown in Extended Data Fig. 8a, b. 

Even though the test only provides a rough estimation onthe required 
number of programming pulses, it indicates that it strongly depends 
on the gap between the starting conductance and the desired state. 
The larger the gap is, the more pulses are needed. Besides, a higher 
programming resolution—for example, a greater number of required 
quantized conductance states within the switching window or asmaller 
desired error margin—would also require a larger number of pulses. 

In addition, writing currents are crucial for system design, especially 
for the calculation of the system energy. However, the programming 
currents cannot be deduced directly based on the reading currents 
and writing voltages owing to the nonlinear current-voltage curve. 
To estimate the programming currents accurately, we swept the d.c. 
voltage on a single 1T1R cell to measure the write current. 

The result is shown in Extended Data Fig. 8c, d. The SET current is 
around 60 pA at 1.5 Vand the RESET current is around 45 pA at -1.2 V. 
Both voltages are smaller than those measured during the pulse pro- 
gramming process in the array (that is, 2.0 V for SET pulse and -1.8 V 
for RESET pulse). This is because the 50-ns pulse width used for pulse 
programming is much shorter than the voltage duration inthe d.c. test. 


Evaluation of recognition accuracy 

Although we have successfully demonstrated the mCNN using paral- 
lel operations, the test system crashes easily for long running periods 
owing to unstable interface connections—for example, the UART inter- 
face between the upper computer and lower computer and the FMC 
connector between the ZC706 board and the customized PE board 
(Supplementary Information). Besides, the specific implementation of 
the test system—such as the quantity and speed of the commercial ADC 
chips—is not optimized for a high-performance design. To facilitate a 
reliable accuracy analysis within a stable connection period, in this 
study the conductance of each memristor in different PEs is written 
first. Then, the current of each memristor is sensed, and this value is 
consequently used to calculate the recognition accuracy using the 
ARM core of the test system. The computation process is similar to 
that realized by the hardware. 


Learning and tuning of FC weights 

During the second phase of hybrid training, we adopted in situ learn- 
ing to adjust the FC memristor weights. A stochastic gradient descent 
(SGD)" with a batch size of 100 was used. Even though this mini-batch 
SGD technique may require extra memory resources to store the inter- 
mediate results, it could increase the converging speed and mitigate 
the overfitting issue. In addition, the memory overhead could be mini- 
mized by using the proposed hybrid training method to update the FC 
weights only and eliminate the demand for storing the intermediate 
results of all convolutional layers. 


For a single iteration cycle, the 100 images drawn from the 55,000 
training images were fed into the mCNN and processed from the initial 
to the final output layer. Then, the gradients of the objective function 
(here, the cross-entropy loss function) with respect to the FC weighted- 
sum outputs were determined using the softmax probabilities and the 
associated true image labels. Later, the quantitative updates of the 
FC weights were calculated from the intermediate FC inputs and the 
gradients as follows: 


100 
AW=n YV;x8; (1) 
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Here, the learning rate nis aconstant; AW describes the desired updates 
of the weight matrix; V;is the intermediate 192-dimensional column vec- 
tor injected into the FC layer; 6, is the calculated ten-dimensional row 
vector representing the objective derivatives of the FC outputs; and i 
represents the image index in the batch of 100 images. The accumulated 
weight updates determine the conductance changes that are ultimately 
needed on the basis of the following threshold learning rule”: 
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where AW,,,,, represents the update cell at the cross point of rowmand 
column n in the weight-update matrix, and Th represents the prede- 
fined threshold value used to determine whether the corresponding 
memristor needs to be programmed. In this study, Th was equal to 1.5 pS 
(that is, 0.3 pA ata 0.2-V read pulse). This threshold learning rule tends 
to reduce the number of programming operations by filtering out the 
original tiny updates, and results in training acceleration and energy 
saving. Then, parallel programming of the FC memristors could be 
conducted row by row” to achieve the desired updates accordingly. 
The closed-loop writing method was introduced to circumvent the 
nonlinear and asymmetric conductance tuning issue, which could be 
addressed by exploring new basic weight units” and programming 
schemes’. Alternatively, if the device performance (for example, the 
linearity and symmetry) could be further improved, faithful in situ 
updating could be used with the SGD updating method. This could 
be more energy- and latency-efficient by encoding the residual error 
from the output side and the input data from the input side to the cor- 
responding programming pulses directly. 


Degradation of conductance weight with time 

In hybrid training, the kernel weights were programmed only during 
the weight-transfer stage. Thus, we set up this experiment by writing 
all the convolutional kernel weights onto two memristor PEs. After 
programming all the conductance weights smoothly, we read out these 
weights to assess how the conductance weights evolved within 30 days. 

Extended Data Fig. 3a illustrates how the differential conductance 
weights (represented by the current read at 0.2 V) drifted with time. The 
cluster of grey curves in Extended Data Fig. 3a includes the evolution 
traces of all the conductance weights, where each line represents one 
individual weight. In the foreground, three typical evolution traces of 
the conductance weights are highlighted to show the general trend. 
Because the conductance weights were quantized and programmed 
using 15 levels, we divided all the weight cells in Extended Data Fig. 3a 
tothese 15 different weight levels, and obtained the mean weight values 
for each level statistically, as shown in Extended Data Fig. 3b. It can be 
seen that the 15 levels are still accessible and there is no overlapping 
between adjacent levels over time. 

Extended Data Fig. 3a indicates that the majority of cells can still 
maintain the weights well, even though there are some tail cells exhibit- 
ing noticeable weight drift with time. However, these tail cells could 
degrade the system accuracy, which will be discussed in the next 
section. 


Hybrid training could be used to address the device reliability issue 
caused by conductance drifts to some extent, instead of adopting the 
expensive reprogramming strategy. However, the reliability of the 
multiple conductance states needs to be further investigated” and 
improved by device and material engineering®®. The performance of 
memristor-based neuromorphic systems would benefit considerably 
from the improvement of device reliability and other non-ideal device 
characteristics. 


Effect of conductance weight degradation on recognition 
accuracy 

By repeating the experiment described in Fig. 3, we investigated how 
the drifts of the conductance weights affect the system recognition 
accuracy after hybrid training. The inference accuracy and conduct- 
ance weights were recorded at 10, 30, 60, 90 and 120 min after hybrid 
training. 

Extended Data Fig. 3c illustrates how the system accuracy changes 
during the experiment. Similarly to Extended Data Fig. 3a, in Extended 
Data Fig. 3d we plot the state evolution curves of all the involved 
weights, including the convolutional kernels and the weights of the FC 
layer, and three typical lines. Most of the weight states are maintained 
well within the first 2 h after hybrid training; however, the conductance 
drifts of the tail cells lead to apparent accuracy degradation. 


Training process in parallel memristor convolvers 

After transferring the weights, three fetched batches of training images 
were passed into the three convolver copies separately. By applying the 
input signal as described in the previous section, we captured three 
independent batches of interim output of the S4 layer and organized 
them as the input to the FC layer in a pipeline fashion. The training 
scheme sets the constraint that a batch of intermediate outputs will 
not be supplied as input until the previous batch has been used to cal- 
culate the desired weight updates and the corresponding FC memristor 
conductances have been well tuned. The desired updates of the FC 
weights with respect to the first input batch were calculated according 
to equation (1), and the relevant memristor conductances were modu- 
lated following the threshold learning rule of equation (2). Then, the 
FC conductances were updated after inputting the second input batch 
based on the well tuned FC weights of the previous phase. Afterwards, 
the third batch was used to tune the FC conductance weights sequen- 
tially. During this updating stage, another three batches were drawn 
from the training database and fed into the unoccupied memristor 
convolvers in parallel. These operations were repeated until the system 
converged to a stable recognition accuracy. 


Benchmarking of system metrics 

We evaluated the hardware performance of the memristor-based neu- 
romorphic computing system using the experimental data. Based on 
the calculation results, we can conclude that the system can achieve 
110 times better energy efficiency and 30 times better performance 
density compared with Tesla V100 GPU. 

To benchmark the performance of the memristor-based neuromor- 
phic computing system, we propose a neural processing unit architec- 
ture (shown in Extended Data Fig. 5) corresponding to the structure 
in Fig. la. It consists of multiple memristor tiles and each tile contains 
four memristor cores. The memristor core comprises one 128 x 128 
memristor array and all the essential peripheral circuits, including 
drivers, multiplexer (MUX), MUX controller, sample-and-hold blocks 
and ADCs. Using the macro core, the typical energy efficiency and 
performance density are assessed by combining the experimental 
data (measured from the fabricated memristors at a130-nm technol- 
ogy node) and simulation data obtained with the simulator XPEsim”. 

Inthe memristor macro core, we maximize the computing parallel- 
ism by connecting two sample-and-hold blocks (S & H groups land 2in 
Extended Data Fig. 5) to each column of the array. Every four columns 
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share one common ADC converter to save chip area and power. When 
applying a1-bit read pulse to all rows at 20 MHz, one of theS & H groups 
is turned on and is connected to the source lines in parallel to convert 
the accumulated current to voltage. During the next 1-bit read period, 
the S & H blocks are redirected to another S & H group. Meanwhile, at 
the beginning of this read phase, the stable voltage signals of the previ- 
ous S & Hgroup are passed to the ADC block through the control of the 
MUxX-based data path, where every four stable voltages are converted 
inturntoa digital signal by the shared ADC at the MUX. The 8-bit ADC 
completes four conversations during the 1-bit inference stage and 
consumes 2.55 pJ of energy for each conversion. In this manner, there 
is no idle period for the ADCs and the input pulses are continuously 
fed into the array. 

The detailed metrics, including the energy, latency and area of each 
block, are listed in Extended Data Table 1, which indicates the system 
performance for an input of a1-bit read pulse (0.2 V, 50 ns). In Extended 
Data Table 1, the memristor-related metrics are evaluated with the 
measured memristor (130 nm technology node) characteristics. The 
parameters associated with the other peripheral circuitry blocks are 
extracted using the simulated circuits at the 65-nm technology node, 
except for the S & H block” and the 8-bit ADC block*. When inferenc- 
ing with a 0.2-V, 50-ns read pulse and considering all the 32 ADCs and 
other circuitry blocks, the energy consumption for 1-bit computing is 
371.89 pJ, and the total occupied chip area is 63,801.94 j1m?/90.69% = 
0.0704 mm’ (the area efficiency is 90.69% for the layout of the macro 
blocks). Hence, we can assess the metrics of the memristor-based 
neuromorphic computing system when inputting an 8-bit integer by 
evaluating the performance, power, area, energy efficiency and per- 
formance density, as shown in Extended Data Table 2. 

From the above calculations, we obtained an energy efficiency of 
11,014 GOP s! W ‘and a performance density of 1,164 GOP s'mm~. 
Compared with the metrics of Tesla V100 GPU” (that is, energy effi- 
ciency of 100 GOP s!W and performance density of 37 GOP s mm 
for 16-bit floating-point number computing), the memristor-based 
neuromorphic computing system shows roughly 110 times better 
energy efficiency and 30 times better performance density. It should 
be mentioned that some necessary functional blocks—suchas the pool- 
ing function, the activation function, and the routeing and buffering of 
data between different neural-network layers—were not considered in 
the comparison. Our system performance could be further improved by 
using more advanced technology nodes and optimizing the computing 
architecture and peripheral circuits. 

Furthermore, in Extended Data Table 1, we break down the power 
consumption of each circuitry block in the macro core during the 
1-bit inference period. The ADC accounts for 14.4 times the power of 
the memristor array; however, this number is expected to decrease 
when the memristor array size increases. For example, the ADC blocks 
(52 mW) would only account for 1.8 times the power of a 1,024 x 1,024 
memristor array (29 mW). This result suggests that both the array size 
andthe ADC optimization should be carefully considered to achieve the 
best computing efficiency of memristor-based neuromorphic systems. 


Scalability demonstration using the ResNET model and the 
CIFAR-10 database 

Replicating the same kernels to different memristor arrays is a crucial 
approachto improving the efficiency of memristor-based convolvers. 
This replicating method could mitigate the speed mismatch between 
the convolutional layer and the FC layer, and overcome the difference in 
the amount of convolution operations among different convolutional 
layers. In practice, we can duplicate a certain part of the kernels to real- 
ize efficient acceleration. For example, the first convolutional layer ina 
CNNnormally contributes the greatest number of sliding convolutional 
operations because it has the largest input size; therefore, it causes 
the largest convolutional computing latency compared with the other 
layers. In the respect, it is reasonable to replicate only the kernels on 


the first convolutional layer. Further studies are required to optimize 
the replicating strategy in the architecture design to yield the desired 
system performance. 

To validate that the approach of replicating the same kernel to dif- 
ferent memristor replicas, combined with the hybrid training method, 
is scalable to larger networks in the presence of intrinsic device vari- 
ability, a standard residual neural network, ResNET-56, was tested on 
the CIFAR-10* database. We used the compact model incorporating 
the device variability to simulate the real device performance. Taking 
the programing error into consideration, a Gaussian distribution was 
employed to model it as 6 [nA] ~ N(O nA, 108 nA). Here dis the program- 
ming error compared to the target conductance, and M(u, 0) denotesa 
Gaussian distribution with mean value yw and standard deviation o. The 
statistical parameters were extracted from the measurement results. 
During the simulation, the memristor was programmed at eight dif- 
ferent levels during the weight-transfer stage. We then realized the 
equivalent 15-level weight by the differential technique as in the experi- 
ments. The kernels in the first convolutional layer were replicated into 
four copies of memristor arrays in the ResNET-56 model. Theoretically, 
this ResNET-56 model requires 782 memristor arrays with a size of 
144 x 16 to implement all the weights. 

The initial accuracy achieved by the software was 95.57%, which 
was degraded to 89.64% after the quantization of the 15-level weights. 
Subsequently, the quantized weights were mapped to the memris- 
tor arrays in the weight-transfer stage, and the recognition accuracy 
further decreased to 80.06%. However, after the in situ training of the 
FC layer, the accuracy ultimately recovered to 94.08%—thatis, a slight 
degradation of 1.49% compared with the baseline of 95.57%, as shown 
in Extended Data Fig. 6a. Extended Data Fig. 6b presents the error rates 
for the replicated G1, G2, G3 and G4 groups, which decreased fromthe 
initial values of 20.24%, 19.83%, 19.58% and 19.84% to 6.11%, 5.84%, 5.87% 
and 6.34%, respectively. 


Data availability 

The datasets that we used for benchmarking are publicly available 
The training methods are provided in refs. °*°. The experimental setups 
are detailed inthe text. Other data that support the findings of this study 
are available from the corresponding author upon reasonable request. 
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Code availability 


The simulator XPEsim used here is publicly available”. The codes used 
for the simulations described in Methods are available from the cor- 
responding author upon reasonable request. 
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Extended Data Fig. 1| Cumulative probability distribution of memristor 
conductance for the remaining seven 2,048-cell arrays. The red circles 
highlight abnormal data points, which deviate from their target conductance 
ranges owing to device variations. a, The 32-level conductance distribution on 
an entire 2,048-cell array. b-g, Conductance distributions for the first 32 rows 
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of memristors (namely, 512 devices) for each of the remaining 2,048-cell arrays. 
Asmall number of writing errors were observed during the programming 
procedure (red circles), which could be attributed to device variations. These 
results show good consistency with Fig. Ic. 
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Extended Data Fig. 3 | Drift of conductance weights in time and associated 
degradation in system accuracy. a, Changes in the conductance weights with 
time, over 30 days after the transfer. The grey lines present the changing traces 
of all the cell weights, and the three coloured lines depict representative 
evolution trends. b, Mean weight value for the cells that belong to each of the15 
levels according toa. The 15 coloured traces show the 15 mean-value evolution 
traces asa function of time. c, Profile of accuracy loss during the experiment. 
The overall trend of the accuracy loss indicates how the conductance weight 
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drifts deteriorate the recognition accuracy over time after hybrid training. 
Compared with the initial state, the recognition accuracy increases by 0.37% at 
t=10 min, owing to random device-state fluctuations. d, Evolution of the 
weights of the weight cells considered inc over 2h. tf) denotes the moment 
when the hybrid training is completed. The grey lines show the changing traces 
of the states of the cells, and the three coloured lines depict representative 
evolutiontrends. 
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rates with respect to the G1, G2 and G3 groups decrease from 4.82%, 6.43% and 
5.85% to 2.89%, 4.22% and 3.40%, respectively, after hybrid training. b, 
Simulation results for all combinations of tuning weights for different layers 
using hybrid training and the five-layer CNN. 


Article 


Memristor based Neural processing units 


Memristor based Computing Macro Core 
Viead (0.2 V) BL Dri : 
BL[7] BL[1]BL(0] _ 128x128 Memristor Array WL Driver 
SLL |} rL> 


BL[7] BL[1]BL{0] 


ey 
Ms 


A] 
ra 


X) 


STL 


BL[7] BL[1]BL{0] 


(. >} —— +} 
ma 
5 ii 


JIL ie 
rez 
rot 
8 pulses 
@ 20 MHz 


SL Driver F: Les dy sh 
fk keh 4 
mr 
I 7 


Vie, WY. Y 

ma ja aA a 
Eran Sr 

Ss _——————— 


eo eh 
no Pan 
= ay 


x) XX 
Switch signal —> = a 


Sample & Hold 


i MO KOR 
A DA BS oA DYN LYN 
rg 


Controller 


<< Switch signal 


Shift & Adder 
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using each convolver group. c, The error rates drop considerably after hybrid 
training using 10% of the training data in the experiment with the five-layer 
CNN.The three experimental results show good consistency. d, Recognition 
accuracies at different stages of the simulation with ResNET-56. A high level of 
accuracy is achieved even when using 3% of the training data (1,500 training 
images) to update the weights of the FC layer. The mean accuracy for 10 trials is 
92.00% after hybrid training, and the standard deviationis 0.8%. 
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Extended Data Table 1 | Detailed metrics of each circuitry module in the macro core with 1-bit input 


Module Area/um/? Latency/ns Energy/pJ 

Array 1107.56 - 22.59 
BL Driver 2104.36 0.002 1.99 
WL Driver 2104.36 0.008 | 1.99 
SL Driver 1686.3 0.38 0.11 
Sample & Hold 10 - 0.13 
Mux 1741.9 0.008 2.37 
Mux Decoder 69.84 6.10 0.05 

ADC 48000 - 326.4 

Shift & Adder 6977.62 0.19 16.26 

Sum 63801.94 | - 371.89 


BL, bit line; WL, word line; SL, source line. 


Extended Data Table 2 | Benchmark metrics of a single macro core with 8-bit input 


Performance (128 x 128 x 2) ops / (8 x 50 ns) = 81.92 GOP s"! 
Power 371.89 pJ / 50 ns = 7.438 mW 
Area 0.0704 mm? 
Energy efficiency 81.92 GOP s!/ 7.438 mW = 11,014 GOP s! W"! 
Performance density 81.92 GOP s! / 0.0704 mm? = 1,164 GOP s! mm? 


ops, operations. 
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Most bulk-scale graphene is produced by a top-down approach, exfoliating graphite, 
which often requires large amounts of solvent with high-energy mixing, shearing, 
sonication or electrochemical treatment’ °. Although chemical oxidation of graphite 
to graphene oxide promotes exfoliation, it requires harsh oxidants and leaves the 
graphene with a defective perforated structure after the subsequent reduction step 
Bottom-up synthesis of high-quality graphene is often restricted to ultrasmall 
amounts if performed by chemical vapour deposition or advanced synthetic organic 
methods, or it provides a defect-ridden structure if carried out in bulk solution‘ ®. 
Here we show that flash Joule heating of inexpensive carbon sources—such as coal, 
petroleum coke, biochar, carbon black, discarded food, rubber tyres and mixed 
plastic waste—can afford gram-scale quantities of graphene in less than one second. 
The product, named flash graphene (FG) after the process used to produce it, shows 
turbostratic arrangement (thatis, little order) between the stacked graphene layers. 
FG synthesis uses no furnace and no solvents or reactive gases. Yields depend on the 
carbon content of the source; when using a high-carbon source, such as carbon black, 
anthracitic coal or calcined coke, yields can range from 80 to 90 per cent with carbon 
purity greater than 99 per cent. No purification steps are necessary. Raman 
spectroscopy analysis shows a low-intensity or absent D band for FG, indicating that 
FG has among the lowest defect concentrations reported so far for graphene, and 
confirms the turbostratic stacking of FG, which is clearly distinguished from 
turbostratic graphite. The disordered orientation of FG layers facilitates its rapid 
exfoliation upon mixing during composite formation. The electric energy cost for FG 
synthesis is only about 7.2 kilojoules per gram, which could render FG suitable for use 
in bulk composites of plastic, metals, plywood, concrete and other building materials. 


3,4 


Inthe flashJoule heating (FJH) process, amorphous conductive carbon 
powder is lightly compressed inside a quartz or ceramic tube between 
two electrodes (Fig. la, Supplementary Fig. 1). The system can be at 
atmospheric pressure, or under a mild vacuum (-10 mm Hg) to facilitate 
outgassing. The electrodes can be copper, graphite or any conduc- 
tive refractory material, and they fit loosely into the quartz tube to 
permit outgassing upon FJH. High-voltage electric discharge froma 
capacitor bank brings the carbon source to temperatures higher than 
3,000 K in less than 100 ms, effectively converting the amorphous 
carbon into turbostratic FG. In high-resolution transmission electron 
microscopy (HR-TEM) analysis (Fig. 1b, c), the misoriented layers of FG 
exhibit the expected Moiré patterns, whereas FG derived from spent 
coffee grounds also shows hexagonal single-layer graphene (Fig. 1d). 

High-quality graphene can be quickly identified by Raman spec- 
troscopy’ 1°. FG from carbon black (CB-FG) has an intense 2D peak. As 


seen inthe Raman mapping of CB-FG in Fig. le, the intensity of the 2D 
band relative to the G band (/,p,,) is greater than 10 in many locations. 
The extremely low intensity of the D band indicates the low defect 
concentration of these FG products, which contributes to the ampli- 
fication of the 2D band. Thus, the unusually high /5p,, = 17 (Fig. 1e) of 
CB-FG is the highest value reported so far for any form of graphene, 
and is probably an outcome of the extreme temperature reached inthe 
flash process, which outgasses non-carbon elements fromthe system. 
Additionally, the two peaks TS, and TS, at -1,886 cm“ and -2,031cm"“, 
respectively, confirm the turbostratic nature of FG (Supplementary 
Figs. 2,3), which is discussed extensively in Supplementary Informa- 
tion and Supplementary Table 1”. 

The X-ray diffraction (XRD) pattern of FG shows a well defined 
(002) peak indicating successful graphitization of the amorphous 
carbon. The (002) peak of FG occurs at diffraction angle 20 = 26.1°, 


Applied Physics Program, Rice University, Houston, TX, USA. 7Department of Chemistry, Rice University, Houston, TX, USA. *Department of Materials Science and NanoEngineering, Rice 
University, Houston, TX, USA. “C-Crete Technologies, Stafford, TX, USA. ‘Department of Civil and Environmental Engineering, Rice University, Houston, TX, USA. °Smalley-Curl Institute and the 
NanoCarbon Center, Rice University, Houston, TX, USA. *e-mail: rouzbeh@ccretetech.com; biy@rice.edu; tour@rice.edu 


Nature | Vol577 | 30 January 2020 | 647 


Article 


3,000 
2,800 


0.000 0.005 0.010 
Time (s) 


Temperature 


Quartz tube 


e Raman looyg Map XRD TEM 
7 Best 14.8 : : 
gx 3 
ag 2 
5 2 
aS ¢£ 
= Oo 
56> 2 
0.4 
1,000 2,000 3,000 20 40 60 80 
Best 10.0 
o 
SENS eo 
28 a 
go Mean - ( 
oO : po 
0.3 eo 
1,000 2,000 3,000 20 40 60 80 
Best 
g 
8 
B° 
£5 Mean 
So 
a> 
1,000 2,000 3,000 20 40 60 80 
= Best 13.0 
& y 
ite} 
o& 
OX 
5 0 Mean 
O° 
xe) 
S 0.3 
1,000 2,000 3,000 20 40 60 80 | 


Fig. 1|FGsynthesized from various carbon sources. a, Schematic of the FJH 
process, and plot of the temperature rise versus time during flashing (inset). 
b-d, HR-TEMimage of CB-FG on top ofa single layer of coffee-derived FG. 

e, Characterization results, including Raman spectra (showing the best and 
the mean obtained spectra), XRD spectra and TEM images for FG derived from 
various carbon sources. The coffee-derived FG is made from used coffee 
grounds; the smaller graphene particles within large graphene sheets come 


which corresponds to an interlayer spacing of /,=3.45 A. This spacing 
is larger than that in a typical Bernal (AB-stacked) graphite, 3.37 A, 
indicating the expanded and turbostratic structure of FG. The (002) 
peak was found to be unsymmetric, with a tail at small angles, which 
further suggests the turbostratic nature of FG”. The flash process 
is fast enough to prevent AB stacking. CB-FG has a surface area of 
~295 m? g with pore size <9 nm, as measured by Brunauer-Emmett- 
Teller analysis (Supplementary Fig. 4). Calcined petroleum coke 
(CPC) also works well for conversion to CPC-FG (Fig. le, Supplemen- 
tary Table 2) which has a similar nanostructure to that of CB-FG. 
Together with carbon black, CPC is listed as anon-graphitized carbon 
source (Supplementary Table 3)". The average size of CB-FG and CPC- 
FGis -13 nm and -17 nm, respectively (Supplementary Figs. 5, 6). The 
yield of the FJH process is as high as 80% to 90% from high-carbon 
sources such as carbon black, calcine coke or anthracite coal, and 
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from the carbon black conductive additive. Each pixel inthe Raman mappingis 
4m? using a50x magnification. All Raman samples were prepared from the 
powdered product after FJH; the samples were not exposed to the solvent 
before Raman analysis. Coffee is about 40% carbon, so the yield based onthe 
starting carbon content is ~85%. The sample size for the mean Raman spectrum 
is10. 


the electric energy needed for their conversion is ~7.2 kJ g' (Sup- 
plementary Information). 

In the case of coffee grounds, the used grounds were mixed with 
5wt% carbon black to increase its conductivity—alternatively, we could 
use 2-5 wt% FG froma previous run as the conductive additive. Coffee 
grounds, being predominantly carbohydrate, are ~40% carbon. Hence, 
the yield of graphene of -35% (Fig. le) would be ~85% conversion of the 
coffee carbon content into graphene, whereas the heteroatoms sublime 
out at these reaction temperatures (>3,000 K). Anthracite can be suf- 
ficiently conductive to be used in the FJH reactor, but better results were 
obtained by adding 5 wt% carbon black. Although a black FG powder 
is produced regardless of the starting material, FG from graphitizing 
carbons—suchas from used coffee grounds (C-FG) and anthracite coal 
(A-FG) (see Supplementary Information for definitions of graphitizing 
and non-graphitizing carbons; see also Supplementary Table 3)—has 
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Fig. 2 | FJH critical parameters. a, Raman spectra of CB-FG with increasing 
flashing voltage (top to bottom). b, /,),,and /p,, ratios of CB-FG at different 
flashing voltages. The bars represent 1s.d. (n=10).c, Time-temperature graph 
of CB-FG reacted at different temperatures. The temperature is regulated by 
the flashing voltage. d, Time-temperature graph of CB-FG reacted at different 
flashing durations. The flashing duration is regulated by the sample 
compression between the electrodes, which affects the sample conductivity. 


different morphologies from CB-FG. Graphitizing carbons produce 
larger graphene sheets (Supplementary Fig. 7). XRD analysis of C-FG 
shows, in addition to the dominant (002) peak at 26.0°, a sharp (100) 
peak at 20 = 42.5°, which is associated with the in-plane interatomic 
spacing (Fig. le). The narrow full width at half-maximum of the (100) 
peak suggests larger in-plane sheet sizes relative to FG formed from 
some of the other starting materials. HR-TEM reveals folded graphene 
sheets in A-FG and C-FG (Fig. le) with an average size of 0.5 pm and 
1.2 um (Supplementary Fig. 7), respectively, similar to the size of gra- 
phene sheet obtained by exfoliation of graphite*>"*. Selected-area elec- 
tron diffraction measurements on these samples show both monolayer 
and turbostratic graphene (Supplementary Figs. 8-10). 

Other carbons that are abundant, renewable or waste-sourced can 
be used, such as charcoal, biochar, humic acid, keratin (human hair), 
lignin, sucrose, starch, pine bark, olive oil soot, cabbage, coconut, 
pistachio shells, potato skins, rubber tyres and mixed plastic (Sup- 
plementary Fig. 11, Supplementary Table 4), including polyethylene 
terephthalate (PET or PETE), high- or low-density polyethylene, poly- 
vinyl chloride, polypropylene and polyacrylonitrile. When converting 
synthetic polymers into FG, the non-carbon atoms sublime out as small 
molecules, leading to a very-high-carbon-content product, as shown 
here. However, polymer and rubber depolymerization can also ensue 
to afford oligomers that sublime before conversion; therefore, it is 
more economical to use a pyrolysis product where the volatiles are first 
industrially removed for fuel sources” and the residual carbon is con- 
verted into FG. This was demonstrated here with rubber-tyre-derived 
carbon black (Supplementary Fig. 11,Supplementary Tables 2, 4). None 
of these FG processes was optimized. Optimization was performed 
only onCB-FG, as described below. The FJH process can provide a facile 
route to convert these waste products into FG, a potential high-value 
building-composite additive!®. 
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The numbers within the plot represent cooling rates. e, Raman spectra of 
CB-FGat different compression ratios. Higher compression provides lower 
resistance to the sample. f, Raman spectra of the CB-FG samples showninc. 
g, Raman spectra of the CB-FG samples shown ind. The 150-ms pulses land 2 
have similar duration but different cooling rates, as shownind. All Raman 
spectra in the figure were taken at low magnification (5x) to obtainamean 
spectrum of the sample from 10 spectra. 


The graphene/,p,,is optimized by adjusting the sample compression 
between the electrodes (which affects sample conductivity), the capaci- 
tor voltage and the switching duration to control the temperature and 
duration of the flash (Fig. 2a—g). Increasing the voltage will increase the 
temperature of the process. The temperature is estimated by fitting 
the black-body radiation spectrum in the 600-1,100 nm emission (Sup- 
plementary Fig. 12). We investigated the quality of CB-FG using Raman 
spectroscopy at low magnification (See Methods) by varying the time 
and temperature in the FG synthesis process. At <90 V and <3,000 K, 
FG has a high D peak, indicating a defective structure (Fig. 2a—c, f). By 
increasing the voltage output, CB-FGis formed at 3,100 K, which has a 
low number of defects and almost no D band in the Raman spectrum. 
Therefore, 3,000 K is a critical temperature for producing higher- 
quality graphene with a larger /,p,, value. 

By increasing the compression on the sample between the two 
electrodes, the conductivity of the carbon source increases, thus 
decreasing the discharge time (Fig. 2d, e, g). While maintaining the 
flash temperature between experimental runs at ~3,100 K, a short 
flash duration of 10 ms results in a higher 2D band, whereas a flash of 
50-150 ms results ina lower 2D band product (Fig. 2g). This indicates 
that, given more time, the graphene flakes stack, orient and form more 
layers, lowering the 2D band of the resulting FG. A low cooling rate 
increases the flash duration and decreases the 2D band”. Therefore, to 
obtain a high/,p,,,a thin quartz tube is chosen to accelerate the radia- 
tive cooling rate. Interestingly, although the internal temperatures 
exceed 3,000 K, the external walls of the quartz tubes are only warm 
to the touch (<60 °C) after the flash process. Most of the heat exits as 
black-body radiation. 

X-ray photoelectron spectroscopy analysis shows a considerable 
reduction of elements other than carbon in FG and increases in the 
sp’ carbon bond content (Supplementary Figs. 13, 14). Carbon has a 
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Fig. 3 | Molecular dynamics simulations. Structures with various 
characteristics (such as micro-porosity, misalignment and size of graphitic 
domains) kept at a given temperature range (1,500 to5,000K) forupto5x10°s 
with aNosé-Hoover thermostat. a—c, Sample structures for carbon materials of 
density 0.8 gcm*™ (a; sponge-like structure), 1.1g cm™ (b),and1.5gcm°(c;high 
degree of graphitization) after annealing at 3,000 K.d, Carbon black with 


high sublimation temperature of ~3,900 K; other elements such as 
aluminium or silicon volatilize out at <3,000 K. 

Thermogravimetric analysis in air shows that FG products are more 
oxidatively stable than the materials from which they are derived (Sup- 
plementary Fig. 15) and more stable than reduced graphene oxide 
obtained with Hummer’s method”. Insome cases, silicon oxide residues 
are detected, which come from worn out quartz tubes after multiples 
uses. 

Previous studies have shown that graphene can be synthesized 
without catalysts at extremely high temperatures” *°. However, when 
FG is optimized as shown here, it can have exceptionally high quality 
when the reaction time and temperature are controlled. Furthermore, 
the electric current can facilitate the crystallization of graphene”. 
Degassing of hydrogen, nitrogen and oxygen during the FJH process 
might contribute to the formation of large and thin graphene sheets 
in coffee-derived FG because it could prevent stacking of graphene 
layers, thereby permitting further growth?>”°”?, 

To assess the mechanism of the rapid FG growth, we employ large- 
scale simulations with the AIREBO*° interatomic potential as imple- 
mented in the LAMMPS package (see Methods)”. Some of the acquired 
structures are shown in Fig. 3a—d. The low-density materials yield a 
sponge-like structure (Fig. 3a) during annealing, whereas increased 
density leads to a high level of graphitization (Fig. 3c). We note the 
high level of graphitization in the low-density CB sample, where the 
substantially increased local density is combined with high macro- 
porosity (Fig. 3d). Additionally, the annealing process is quantified 
by the sp?/sp’ ratio during simulation (Fig. 3e, f). We find that the gra- 
phene formation process is strongly impaired at lower temperatures 
(<2,000 K) but greatly accelerated at higher temperature (5,000 K) 
(Fig. 3g)—a trend that is also suggested by experiments (Fig. 2f). Inthe 
case of carbon black, continuous defect healing during FJH results in 
the gradual conversion of initially roughly spherical centroid particles 
into polyhedral shapes (Fig. 3d) that appear as fringes at clearly defined 
angles in TEM images (see Fig. 1b, e), further confirming the low-defect 
nature of the produced materials. 

The FJH process was scaled up by increasing the quartz tube size. With 
quartz tubes of 4mm, 8 mmand15 mm diameter, 30 mg, 120 mg and1g 
of FG can be synthesized per batch, respectively. Figure 4a shows the 
amount of CB-FG obtained with the three tube sizes. The shorter flash 
from the smaller tube results in FG with a higher /,p/,. To increase the 
batch size while maintaining FG quality, flat tubes are helpful because 
they enable a higher cooling rate (Fig. 4a). For industrial production, we 
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density 0.8 gcm™°and large macro-porosity, after prolonged (5 x 10” s) 
annealing at 3,600 K; polygonal fringes are apparent. e-f, Change of the 
structural composition of materials during annealing for materials of different 
densities p (e) and for annealing at different temperatures 7 (f).g, Structure of 
material with density 1.5 g cm” after annealing at 5,000 K; the initial structure is 
thesameas that showninc. Allscale bars are1.5nm. 


envision that the process can be automated for continuous FG synthesis 
(Supplementary Fig. 16). 

FG was found to be dispersible in water/surfactant (Pluronic F-127) 
to give highly concentrated dispersions reaching 4 g I (Fig. 4b, Sup- 
plementary Fig. 17). Using organic solvents, FG has a high degree of 
dispersibility (Fig. 4c) *,which can be attributed to the turbostratic 
arrangement permitting efficient exfoliation; the interlayer attraction 
forces are much lower than in conventionally arranged AB-stacked 
graphene obtained by graphite exfoliation. 

FG composites were explored, revealing considerably enhanced 
physical properties at small FG loadings. CB-FG-cement composites 
with 0.05% FG and cured for 28 days had ~25% higher compressive 
strength than the FG-free control sample (Supplementary Fig. 18). 
This enhancement in the compressive strength is three times higher 
than the values reported recently for cement composites reinforced 
by electrochemically exfoliated graphene with the same graphene 
loading, and slightly larger than those of other cement-graphene 
composites**””. The seven-day compressive and tensile strength of 
CB-FG-cement composites with 0.1% FG loading are ~35% and -19% 
higher, respectively, than those of the FG-free control sample (Fig. 4d). 
These enhancements are almost three times larger than those of 
other reported graphene-cement composites with the same load- 
ing, demonstrating rapid strength development. Scanning electron 
microscopy images of CB-FG-cement composites (Supplementary 
Fig. 19) show a homogeneous distribution of FGin the cement matrix. 
The largely enhanced properties and rapid strength development of 
CB-FG-cement composites is again attributed to the high dispers- 
ibility of the turbostratic CB-FG, which results in greater homogeneity 
and robust composites (see Supplementary Information for further 
explanations). 

In addition, CB-FG effectively enhances polymer properties. A 
0.1wt% CB-FG-polydimethylsiloxane (PDMS) composite showed ~250% 
increase in compressive strength compared with PDMS without gra- 
phene (Supplementary Fig. 20). 

To demonstrate its applicability in electrochemical energy storage 
devices, C-FG and CPC- FG were also used as electrode materials ina 
Li-ion capacitor and a Li-ion battery (Supplementary Fig. 21), dem- 
onstrating the potential to use FG in advanced energy applications. 

Insummary, alow-energy bottom-up synthesis of easily exfoliated tur- 
bostratic graphene was demonstrated from ultralow-cost carbon sources 
(suchas coal and petroleum coke), renewable resources (suchas biochar 
and rubber tyres) and mixed-waste products (including plastic bottles 
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Fig. 4| Scaling up and applications of CB-FG. a, FJH quartz tubes of different 
sizes and shapes, used to synthesize FG. Two separate synthesis processes were 
conducted with each tube, providing the samples in the tube and those inthe 
plastic dishes. b, FG dispersion in a water-Pluronic (F-127) solution (1%). The 
photograph shows the supernatants of 4g!“ of CB-FGand of 10 gl‘ ofa 


and discarded food). Scaling up of the FG synthesis process could pro- 
vide turbostratic graphene for bulk construction composite materials. 
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Methods 


FJHsystem 

The FJHset up is detailed in Supplementary Fig. 1. Inside a quartz tube, 
two loosely fitting electrodes compress the carbon source using two 
copper-wool plugs or graphite spacers to contact the carbon sources 
to allow degassing of volatile materials. The compressing force is 
controllable by a modified small vice so as to minimize the sample 
resistance to 1-1,000 Q, and is key to obtaining a good flash reaction 
(0.004-4S cm”). To control the discharge time, a mechanical relay with 
programmable millisecond-level delay time is used. The entire sample 
reaction chamber is placed inside a low-pressure container (plastic 
vacuum desiccator) for safety and to facilitate degassing. However, the 
FJH process works equally well at 1 atm. The capacitor bank consists 
of 20 capacitors with a total capacitance of 0.22 F. Each capacitor is 
connected to the main power cable (or bus) by a circuit breaker that is 
also used as a switch to enable/disable each capacitor. The capacitor 
bank is charged by a d.c. supply capable of reaching 400 V. The first 
prototype system is placed conveniently on one plastic mobile cart 
(Supplementary Fig. 1b). Using a large 15-mm-diameter quartz tube, 
we achieve synthesis of 1 g of FG per batch using the FJH process. 

Safety notice: the capacitor bank is capable of generating fatal elec- 
tric pulses. Therefore, the following steps are taken to protect the oper- 
ator as wellas the circuit, and we strongly suggest that these measures 
be followed. Details of the circuit can be found in Supplementary Fig. la. 
Darkened safety glasses should be worn to protect eyes from the bright 
light during the discharge flashing process. 

The voltage and current ratings for the circuit breaker are appropriate 
for the maximum voltage and the anticipated maximum current that 
will be supplied by each capacitor to the FJH discharge on the basis of 
a discharge time of 50-200 ms. We use the maximum charging and 
bleeding voltages at ~400 V with maximum currents of 0.7 Aand 0.1A, 
respectively. The pulse discharging voltage to the sample is ~-400 V and 
current can reach up to 1,000 Ain <100 ms. A 24-mH inductor is used 
to avoid current spikes while using the mechanical relay. Without the 
inductor, the mechanical relay could be prone to high-current arcing 
during the intermittent closing of the circuit. To protect the inductor 
from the spike voltage when shutting off the current, a diode and alow- 
resistance resistor with appropriate ratings are connected parallel to 
the inductor. Additionally, to protect the capacitor from reverse polar- 
ity incase of oscillatory decay (whichcan occur ina fast discharge), an 
appropriate diode is placed parallel to the capacitor bank. 


Characterization 

The resultant FG products were characterized by scanning electron 
microscopy (SEM) using an FEI Helios NanoLab 660 DualBeam SEM 
system at 5 kV with a working distance of 10 mm. X-ray photoelectron 
spectroscopy (XPS) data were collected with a PHI Quantera SXM 
Scanning X-ray Microprobe with a base pressure of 5 x 10° torr. Sur- 
vey spectra were recorded using 0.5-eV steps with a pass energy of 
140 eV. Elemental spectra were recorded using 0.1-eV steps with a pass 
energy of 26 eV. All of the XPS spectra were corrected using the C Is 
peak (284.5 eV) as reference. 

TEM images were taken with aJEOL 2100F field-emission gun TEM at 
200 kV. Atomic-resolution HR-TEM images were taken with an FEI Titan 
Themis S/TEM system at 80 keV. Samples were prepared by dropping 
diluted dispersions (-1 mg ml‘ in isopropanol) of the graphene sample 
(<200 pl) on the TEM Cu grids. The dispersion was prepared using a 
bath sonicator (~15 min). Electron diffraction was calibrated by a dif- 
fraction standard (evaporated Al grid; Ted Pella). 

All Raman spectra were collected with as-prepared FG samples atopa 
glass slide, before exposure to solvent, using a Renishaw Raman micro- 
scope and a 532-nm laser with a power of 5 mW. A 50x lens was used 
for the local Raman spectra in Fig. 1 and a 5x lens for the mean Raman 
spectra in Fig. 2. 


Atomistic modelling 

Atomistic simulations were carried out using periodic bound- 
ary conditions with -15,000 atoms per unit cell for all structures 
except the carbon black model, which contained ~55,000 atoms. 
The initial configurations were created by random positioning and 
misorientation of small graphitic flakes of arbitrary shape and up to 
8-12 Ain diameter, and subsequently adding randomly positioned 
individual carbon atoms (the atomic carbon content was ~50% to 
represent anon-graphitized portion of the source material). Car- 
bon black centroid particles were created by arranging randomly 
oriented graphitic flakes in roughly spherical shapes with hollow 
cores and diameters of up to 12 nm, and adding atomic carbon 
(~50%). The initial configurations were subjected to preliminary 
annealing at 400 K for 2 x 10°’ s to eliminate irregularities caused by 
the structure creation protocol, then heated to the target annealing 
temperature with a heating speed of 0.5 x 10°” K s ‘using a Nose- 
Hoover thermostat (canonical NVT ensemble) with a temperature 
damping parameter of 0.025 x 10°” s. The structures were held 
at the target annealing temperatures for 5x 10° s (15 x 10°’ s for 
carbon black). 


Preparation of flash graphene dispersion in water—Pluronic 
solution 

FG was dispersed in water-Pluronic (F-127) solution (1%) at con- 
centrations of 1-10 gl". The mixture was sonicated in an ultrasonic 
bath for 40 min to obtain a dark dispersion. The dispersion was 
subjected to centrifugation at 1,500 rpm (470 relative centrifugal 
force) for 30 min to remove aggregates using a Beckman Coulter 
Allegra X-12 centrifuge equipped with a 19-cm-radius rotor. The 
supernatant was analysed via ultraviolet-visible spectroscopy 
(Shimadzu). The dispersions were diluted 500 times and the 
absorbance was recorded at 660 nm. An extinction coefficient 
Of Geo = 6,600 1 g! m™ was used to calculate the concentration of 
graphene in the solution. 


Cement sample preparation 

FG at various concentrations was dispersed in 1% water-Pluronic (F-127) 
solution. The dispersion was agitated for 15 min at 5,000 rpm using a 
shear mixer (Silverson LSMA). The graphene suspension in water was 
mixed with Portland cement witha water-to-cement ratio of 0.40. The 
slurry was cast in5 x5 x5cm? cubic polytetrafluoroethylene moulds (for 
compressive strength measurements) and in2.5 cm x 3.8 cm cylindrical 
moulds (for tensile strength measurements). All cubes and cylinders 
were taken out the moulds after 24 h and placed in water for curing 
for another 24 h. The compressive and tensile mechanical strengths 
were measured after 7 and 28 days. For each FG-cement ratio, three 
samples were cast and tested. 


Cement and PDMS testing procedures 

Compressive strength. The compression strength tests were per- 
formed using a Forney Variable Frequency Drive automatic machine 
with dual load cells for maximum accuracy. 


Tensile strength. Owing to the brittle nature of cement-based mate- 
rials, the tensile strength was calculated via a splitting test because it 
gives the most accurate measurement. Special jigs held the cylinders 
so that the uniaxial compressive force applied to the centre lines of the 
bottom and top surfaces of the samples caused tensile stress between 
the points of contact. 


Data availability 


The datasets generated and/or analysed during the current study are 
available from the corresponding author on reasonable request. 
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The uranyl ion (UO,”*; U(v1) oxidation state) is the most common form of uranium 


found in terrestrial and aquatic environments and is acentral component in nuclear 
fuel processing and waste remediation efforts. Uranyl capture from either seawater or 
nuclear waste has been well studied and typically relies on extremely strong 
chelating/binding affinities to UO,” using chelating polymers’”, porous inorganic*> 
or carbon-based°’ materials, as well as homogeneous® compounds. By contrast, the 
controlled release of uranyl after capture is less established and can be difficult, 
expensive or destructive to the initial material”. Here we show how harnessing the 
redox-switchable chelating and donating properties of an ortho-substituted closo- 
carborane (1,2-(Ph,PO),-1,2-C,B,oH,,) cluster molecule can lead to the controlled 
chemical or electrochemical capture and release of UO,” in monophasic (organic) or 
biphasic (organic/aqueous) model solvent systems. This is achieved by taking 
advantage of the increase in the ligand bite angle when the closo-carborane is reduced 
to the nido-carborane, resulting in C-C bond rupture and cage opening. The use of 
electrochemical methods for uranyl capture and release may complement existing 
sorbent and processing systems. 


Known for over 50 years, carboranes have been extensively studied in 
coordination chemistry (including with U), catalysis, luminescence, 
and energy storage applications” ». Studies have shown that reduction 
of substituted closo-carboranes to the nido-carboranes results in rup- 
ture of the C-C bond and cage opening, with a simultaneous increase 
in ligand bite angle, 6 (Fig. 1a; closo and nido refer to 2n + 2 and 2n+ 4 
framework bonding electrons, respectively, where nis the number of 
vertices)""*°"8, We rationalized that by incorporating donating groups 
to ortho-carborane, we could tune the chelating properties of the clus- 
ter switching from opened to closed conformations by redox control of 
the reduced and oxidized states, respectively, and enable the chemical 
or electrochemical capture and release of uranyl in solution (Fig. 1a). 
The closo-carborane 1,2-(Ph,PO),-1,2-C,B, oH, (1) was synthesized 
and fully characterized, including by X-ray diffraction (XRD) 
studies (Fig. la, Extended Data Fig. 1a)’. The cage C-C bond length 
(1.688(4) A) and interatomic P---P distance (3.537 A) are consistent 
with previous relevant reports” (all uncertainties are estimated 
standard deviations or standard uncertainties). These metrics 
will be used throughout to correlate coordinated and uncoordinated 
carboranes, both in lieu of—yet proportional to—the traditional 
bite angle 6 (Fig. 1a). The cyclic voltammogram of lin tetrahydrofuran 
(THF) revealed two quasi-reversible cathodic waves at —0.93 V 
and -1.11 V relative to the ferrocene/ferrocenium (Fc/Fc’) redox couple 
(Extended Data Fig. 2a). Reduction of 1 using 2.0 equiv. decamethy]- 
cobaltocene (CoCp;) afforded the direduced nido-carborane, 
[CoCp3],[(nido-1, 2-(Ph,PO)>-1, 2 —C,BioHjo)] (2a) (Fig. la). The 
solid-state structure revealed an open-cage nido-carborane witha 


cleaved C-C bond (2.860 A) and an elongated P---P distance (5.036 A) 
relative to 1 (Extended Data Figs. 1b, c). An analogous salt, [Bu,N],[(nido- 
1,2-(Ph,PO),-1,2-C,B,.H,,)] (2b) (Fig. 1a), relevant to the electrochemi- 
cal experiments, was also synthesized by reduction of 1 with KC, 
followed by salt metathesis with [Bu,N][Cl] (see Methods). 

We next investigated the coordination chemistry of 1 and 2a. Addi- 
tion of 4 equiv. 1to dimeric [UO,CI,(THF),], in deuterated dichlorometh- 
ane (DCM-d,) resulted in a light-yellow solution from which two new 
equivalent-intensity “P resonances appeared at 38.8 and 38.4 ppmin 
the nuclear magnetic resonance (NMR) spectrum, shifted downfield 
from1 (22.8 ppm). The inequivalent P environments suggest either an 
octahedral geometry at U with two monodentate 1 ligands or a pen- 
tagonal bipyramidal geometry at U with two bidentate land achloride 
inthe fifth equatorial site. Although attempts to obtain single crystals 
for XRD studies failed, the NMR data suggest that a 2:1 adduct is formed 
with a presumed formulation of UO,CI,(1),. In contrast to 1, treatment 
of 2 equiv. 2ato [UO,CI,(THF),], led to clean formation ofa single new 
resonance at 51.1 ppminthe»P NMR spectrum, consistent with a biden- 
tate coordination mode. XRD studies confirmed the composition as 
the uranyl salt [CoCp;],[UO,Cl,(nido-1, 2-(Ph,PO),-1, 2 -C,ByoHio)] 
(3) (Fig. 1a, c). A disubstituted uranyl salt was also obtained by 
addition of 4 equiv. 2a to [UO,CI,(THF),],. Monitoring the reaction by 
3!P NMR spectroscopy revealed clean conversion to a new product 
with a single peak at 52.0 ppm. XRD studies on single crystals 
confirmed the composition as the disubstituted complex 
[CoCp3],[UO,(nido-1, 2-(Ph,PO),-1, 2 -C,ByoHjo)2]_ (4) (Fig. 1a, b). 
The bond metrics for 3 and 4 are similar (Extended Data Fig. 1c). 
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Fig. 1| Chemical or electrochemical capture and release of UO,”* with 
resulting complexes shown. a, General chemical or electrochemical mono- or 
bi-phasic capture of uranyl from UO,X,L, (X=Cl, OAc ; L= THF, Ph;PO) using 
the reduced ‘open’-cage nido-carboranes (2a/2b) generated by reduction (for 
example, CoCp; or negative bias) of the ‘closed’-cage closo-carborane (1). The 
corresponding relative bite angles (0) are also shown. Oxidation (for example, 
[FeCp,][PF,] or positive bias) of the captured products 3/4 or 3N/4N leads to 
UO,” release. Compounds labelled in green have been chemically isolated, 
whereas compounds in orange are proposed electrochemical products 

(see Methods). Blue and red pathways represent UO,” capture and release, 
respectively. b, c, Solid-state molecular structures of 4 (b) and 3 (c) obtained 
from XRD studies. H atoms, [Cocp; ] counter cations, phenyl C-H linkages and 
all co-crystallized solvent molecules are omitted for clarity. See Extended Data 
Fig. 1 forthe structures of land 2a. 


Whereas 1 and 2a coordinate to UO,”, we postulated that a third 
ligand with a competitive binding affinity to 1, but weaker than 2a, could 
enablea pathway to UO,” release. Competition experiments using 1, 2a 
and triphenylphosphine oxide (TPO), as part of UO,CI,(TPO), (ref. ”), 
were performed and monitored by™P{'H} NMR spectroscopy in DCM-d,. 
Two equivalents of 1 were added to UO,CI,(TPO), for a1:1 molar ratio of 
1:TPO. After an equilibration period, the *P{{H} NMR spectrum revealed 
broadened resonances for UO,CI,(TPO), and free TPO, as well as a set 
of sharp resonances for UO,CI,(1), and free 1. The ratio of UO,CI,(1),:1 
was determined to be approximately 1:3, suggesting an equilibrium 
favouring the adduct, UO,CI,(TPO), (Extended Data Fig. 3a). The bind- 
ing affinity of TPO was next compared to 2a by addition of 1 equiv. 2a 
to UO,CI,(TPO),. Rapid precipitation of products was observed. The 
31P'H} NMR spectrum of the DCM supernatant revealed complete con- 
version to the products 3 and 4, along witha sharp singlet for TPO, anda 
minor unknown singlet at 47 ppm. Analysis of the precipitate dissolved 
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in propylene carbonate (PC) by “P{'H} NMR spectroscopy revealed 
the presence of 4 (Extended Data Fig. 3b). These data are consistent 
with full dissociation of TPO from UO,CI,(TPO), in the presence of 2a. 
The binding affinity of TPO was next tested against PC, acoordinating 
solvent”. An initial *P{H} NMR spectrum of UO,CI,(TPO), dissolved 
in DCM-d, revealed two singlets in a 3:1 ratio at 48.09 and 47.97 ppm, 
respectively, probably arising from trans:cis isomerism”. Whereas 
addition of 2 equiv. PC led to negligible changes, addition of 20 and 40 
equivalents led to increasing broadness of the aromatic peaks in the 
'H NMR spectra and broadening of the singlets in the *P{'H} NMR spec- 
tra (Extended Data Fig. 4). Together, these data suggest a weak equi- 
librium with PC that is heavily shifted towards UO,CI,(TPO),. Density 
functional theory (DFT) calculations further supported these obser- 
vations. The electron density surfaces with integrated electrostatic 
potentials for land 2a clearly indicate increased electron density at the 
P=O bonds of 2a upon reduction, accounting for its experimentally 
observed increased Lewis basicity (Extended Data Fig. 1d). This prob- 
ably complements the increased bite angle (Fig. 1a) in rendering 2aa 
stronger chelating agent than 1. In silico isodesmic reactions of 2a/TPO, 
TPO/1 or 1/PC with protons as a model for the uranyl cation were also 
calculated (see Methods and Supplementary Information). Together 
with the experimental data, these data support a relative Lewis basicity 
trend of:2a>TPO>1> PC. 

We next investigated the in situ chemical capture and release of UO,”". 
For optimal solubility, we used a 3:1 PC:benzene solvent mixture. A 2:4 
solution of 1:TPO was analysed by “P{'H} NMR spectroscopy, which 
revealed two sharp resonances (Extended Data Fig. 5a). Addition of 
0.5 equiv. [UO,CI,(THF),], (1 equiv. U) resulted in no appreciable change 
tothe resonance for 1, but in substantial broadening to the resonance 
for TPO (Extended Data Fig. 5b). Only trace UO,CI,(TPO), is observed 
and is probably due to a combination of rapid exchange with excess 
TPO and the excess (~2,000 times) PC used relative to U. To initiate 
chemical capture of UO,”, 4 equiv. CoCp;was added. Analysis by *P{'H} 
NMR spectroscopy revealed the rapid and complete conversion of 
1to 4 with concomitant release of TPO (Extended Data Fig. 5c). To ini- 
tiate UO,” release, we first determined the oxidation potential of 4 by 
cyclic voltammetry, which revealed a quasi-reversible anodic event at 
-0.42 V relative to Fc/Fc* (Extended Data Fig. 2b). Thus, we exposed 
our in situ generated solution of 4and TPO to 4 equiv. [Fc][PF,]. Anal- 
ysis by “P{1H} NMR spectroscopy revealed the full conversion of 4 back 
to 1, along with the re-appearance of a broadened TPO resonance, 
similar to that of the pre-reduced solution (Extended Data Fig. 5d). 
Together, these results demonstrate the successful chemical capture 
and release of UO,” in solution. 

We next targeted the electrochemical capture and release of UO,”* 
by galvanostatic bulk electrolysis (GBE). This was conducted using a 
divided H-cell with coiled Pt electrodes, an anion-exchange membrane 
(AEM) and an excess of the Fc/Fc* redox couple in the counter com- 
partment (Fig. 2a). A 0.5:5:6 ratio of [UO,CI,(THF),],:1:TPO was used in 
PC:benzene (3:1) along with an internal standard for “P NMR integra- 
tion. Whereas the use of excess TPO is well reasoned (see above), the 
use of excess 1 was found to be necessary for optimal electrochemical 
performance (see Methods). An initial “P{’H} NMR spectrum revealed 
a sharp signal for 1 and a broadened signal for TPO (Fig. 2b, initial), 
analogous to the chemical capture/release experiments. Electrochemi- 
cal capture of UO,” was initiated by galvanostatically charging the 
solution toa 75% theoretical state of charge (SOC) relative to the UO,”* 
concentration (Fig. 2c, blue; see Methods). Analysis of the reaction 
mixture by “P{‘H} NMR spectroscopy revealed the conversion of 1 to 
the captured products 3N (X = Cl) and 4N-the analogues of 3 and 4, but 
with [Bu,N]* cations (Fig. 1a)—with release of all TPO, as determined by 
integration versus the internal standard (Fig. 2b, cycle 1, blue; Extended 
Data Fig. 6a). To initiate the electrochemical release of UO,”’, the cell 
was galvanostatically discharged to a final SOC of 15% (Fig. 2c, red; 
the SOC extrema of 0% and 100% were not used, to avoid unwanted 
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Fig. 2| Electrochemical setup and quantification data for the capture (blue) 
and release (red) of UO,” in solution. a, Illustration of the H-cell used, 
incorporating excess Fc/Fc’ (left) and1, TPO and [UO,Cl,(THF),], (right) ina 
3:1PC:benzene solvent mixture. Charging the cell (blue) leads to the capture of 
UO,”, converting 1 to 4N (major product) and 3N (minor product, not shown). 
b, Quantification of products and reactants by “P{1H} NMR spectroscopy 
against an inert internal standard, [Ph,PNPPh,][PF,] (not shown). The initial 
spectrum is shown in grey, whereas spectra acquired during charge and 


secondary electrochemical processes”). Analysis of the reaction mix- 
ture by “P{‘H} NMR spectroscopy revealed the full conversion of 3N and 
4N back to 1, along with the presence of a broadened TPO resonance 
(Fig. 2b, cycle 1, red). The capture and release by GBE was carried out 
over the course of another five full cycles (Fig. 2c), with analyses of the 
reaction mixtures by “P{’H} NMR spectroscopy after each run (Fig. 2b). 
We observed that repeated cycling resulted in a loss of electrochemi- 
cally generated 3N, which we attribute to chloride migration to the 
counter compartment over time. The gradual appearance of a minor 
unknown product witha”P resonance at 45 ppm wasalso observed after 
each charge cycle (Extended Data Fig. 6a). Analysis of the *P{{‘H} NMR 
integrations revealed approximate average losses of 0.3% per cycle for 
TPO, 3.4% per cycle for 4N and 7.2% per cycle for 1, perhaps attributable 
to electrochemical side reactions (Extended Data Fig. 6b)*°”*. Lastly, 
analysis of the measured instrumental charge transferred relative to 
the total charge transferred for UO,”* capture (determined by “P{'H} 
NMR integrations) revealed a plateauing trend with increasing cycle 
number, with differences in charge attributed to Faradaic losses (Fig. 2c, 
top). Together, these results demonstrate the successful monophasic 
electrochemical capture and release of UO,”*. 

A biphasic extraction scheme involving dissolved UO,”* (from 
UO,(NO,),.(THF),) in the aqueous phase and 1 in the organic phase 
was next explored as a model system (Fig. 3a)”’. We switched solvents 
from PC to water-immiscible 1,2-dichloroethane (DCE) and modified 
our H-cell design to include a physical glass-frit separator coupled 
with a heterogeneous carbon additive acting as a capacitive buffer, 
analogous toa previous report”’, owing to the incompatibility of DCE 
with the anion-exchange membrane. The capture and release of UO,” 
was simultaneously monitored by “P{’H} NMR and ultraviolet-visible 
absorption (UV-Vis) spectroscopy for the organic and aqueous layer, 
respectively. We note that the vibronic ligand-to-metal charge transfer 
absorption of UO,” (425 nm) is pH-dependent, resulting ina variable 
extinction coefficient (¢)”*”°; therefore, a buffered solution of UO,”* 
was used. Figure 3 outlines the simplified experimental setup display- 
ing half of the H-cell (see Methods and Extended Data Figs. 7, 8 for the 
full cell design and methodology). A DCE solution of 1 (1.0 equiv.) 
with [Bu,N][PF,] as the supporting electrolyte was galvanostatically 
charged to a -75% theoretical SOC. Analysis of the solution by “P{'H} 
NMR spectroscopy revealed the clean conversion of most of 1to 2b 
(Fig. 3a, Extended Data Fig. 9a). A sodium acetate (NaOAc)-buffered 
(pH 5.4) water solution containing 1.25 equiv. UO,(NO,),(THF), was 


discharge cycles (1-6) are shown in blue and red, respectively. c, Bottom, 
applied galvanostatic potentials for charge (blue) and discharge (red) cycles. 
Dashed lines represent wait periods, which were necessary for “P{1H} NMR data 
acquisition. Each cycle is 24 h. Top, instrumental measure of delivered charge 
(teal) versus charge used for the reduction of 1, measured by quantifying the 
total reduced products, 3N and 4N, by =P NMR spectroscopy. See Methods and 
Extended Data Figs. 6, 8 for additional experimental details and data. 


next added to the DCE layer with mixing. Approximately 0.9 equiv. 
UO,” was captured from the aqueous phase, as evidenced by the 
comparison of the UV-Vis spectra obtained before and after mixing 
with the charged DCE solution (Fig. 3a, b, Extended Data Fig. 10a). 
Analysis of the DCE solution by “P{‘H} NMR spectroscopy revealed the 
clean formation of a single resonance at 51.5 ppm (Fig. 3b, Extended 
Data Fig. 9b). Given the similar chemical shifts of isolated complexes 
3 (51.1 ppm) and 4 (52.0 ppm), as well as the quantity of UO,”* captured 
(0.9 equiv.), we propose that the uranyl is probably the mono-ligated 
3N (X = OAc because of the buffer); however, 4N cannot definitively 
be excluded (Fig. 1a). The aqueous phase was next removed and the 
cell was galvanostatically discharged to achieve a theoretical final 
SOC of -0%. Mixing a fresh NaOAc-buffered solution (pH 5.4) into this 
solution led to the release of approximately 0.5 equiv. UO,”* from the 
DCE layer, as confirmed by UV-Vis spectroscopy (Fig. 3c, Extended 
Data Fig. 10b). Analysis of the DCE layer by “P{‘H} NMR spectroscopy 
revealed the near-quantitative conversion to I, as well as the forma- 
tion of minor (-20%) unknown byproducts (Fig. 3c, Extended Data 
Fig. 9c). We propose that the acetate ions probably act as the biphasic 
analogue of the monophasic TPO ligands by competitively binding 
with 1 to UO,”*. Control experiments revealed that negligible biphasic 
capture of UO,”* from the NaOAc-buffered solution occurred in the 
presence or absence of 1 (see Methods and Extended Data Fig. 10c-f). 
Together, these biphasic GBE experiments demonstrate the potential 
applicability of this redox-switchable capture and release chemistry. 

Insummary, we have introduced an approach to uranyl management 
involving its capture and—importantly—its release by controlled redox- 
switchable chelation using a derivatized ortho-carborane in mono- 
phasic or biphasic (organic/aqueous) environments. We anticipate 
that this fundamentally new direction in cluster carborane chemistry 
will have a considerable impact on nuclear fuel extraction and waste 
sequestration activities, and may lead to new research directions in 
related metal capture and release activities. 
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Fig. 3 | Simplified depiction of half H-cell and spectroscopic measurements 
for the biphasic electrochemical capture/release of dissolved UO,” (yellow 
sphere) from/to buffered aqueous solutions. See Methods and Extended 
Data Fig. 7 for an expanded stepwise figure and all experimental details. 

a, Biphasic mixture of UO,X; dissolved ina NaOAc-buffered aqueous solution 
(pH5.4) and of electrochemically generated 2b from1(X =OAc or NO, ). Inset, 
aqueous UV-Vis and organic P{'H} NMR spectra after reduction of 1 to 2b, but 
before phase mixing. Residual lis observed in the latter owing to the set SOC. 
b, Simplified depiction of the captured UO,X, in the form of 3N and/or 4N. 
Inset, aqueous UV-Vis spectrum showing the capture of UO,X, by the 2b/DCE 
layer (top); the corresponding “P{'H} NMR spectrum of the DCE layer showing 
the captured major product (3N/4N) and minor residual 1 (bottom). c, Biphasic 
release of UO,X, from the DCE layer to a fresh NaOAc-buffered solution 
(pH5.4), following electrochemical oxidation of 3N/4N. Inset, aqueous 

UV-Vis and organic “P{1H} NMR spectra of free UO,X, and 1, respectively—both 
consistent with the release of captured UO,X, from the DCE to the aqueous 
phase. A small amount (-20%) of unknown byproducts (marked by asterisks) is 
also observed in the »P{H} NMR spectrum. 
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Methods 


General considerations 

All manipulations were performed under an atmosphere of dry, oxygen- 
free N, or Ar by means of standard Schlenk or glovebox techniques 
(MBraun equipped with a —38 °C freezer, or VAC gloveboxes). Hexanes, 
pentane, DCM and benzene were dried onan MBraun solvent purifica- 
tion system. Acetonitrile (-H, and -D,) was dried over CaH, for several 
days before distillation. THF was dried over sodium benzophenone 
and distilled. PC was degassed by freeze-pump-thaw cycles and stored 
onactivated 4-A molecular sieves before use. DCE was initially distilled, 
followed by drying over CaH; for several days before a second distilla- 
tion and subsequent storage on activated 4-A molecular sieves. [FeCp,] 
[PF], [Bu,N][Cl] and TPO were purchased from Fisher Scientific; tri- 
mesitylphosphine (Mes,P) was purchased from VWR International; 
nBuLi (1.6 M in hexanes) was purchased from Sigma Aldrich; all were 
used without further purification. Ortho-carborane was purchased 
from Boron Specialties and sublimed before use. Ph,PCI was purchased 
from Sigma Aldrich and vacuum-distilled before use. CoCp; was pur- 
chased from Sigma Aldrich and purified by filtration through celite 
using pentane, followed by recrystallization from pentane at —38 °C 
over several days. [Bu,N][PF,] was purchased from Oakwood Chemicals 
and purified by twice recrystallizing from hot ethanol. The recrystal- 
lized product was then washed with cold water, cold ethanol and pen- 
tane before drying at 100 °C under vacuum for 24 h. Sodium acetate 
(NaOAc) buffer was prepared from a stock solution purchased from 
Sigma Aldrich (pH 4.9) and adjusted to pH 5.4 using NaOH. The pH 
value was confirmed using a pH meter. Ketjenblack EC-600JD was pur- 
chased froma private supplier. UO,CI,(TPO), (ref. ”), [UO,CI,(THF).], 
(ref. *"), [UO,(NO3),(THF)5] (ref. **), KC, (ref. °°) and bis(triphenyl-phos- 
phoranylidene)ammonium hexafluorophosphate ([Ph3,PNPPhs][PF.]) 
(ref. **) were prepared by procedures reported in the literature. 


NMR. NMR spectra were obtained using a Varian Unity Inova S00 MHz 
or Agilent Technologies 400 MHz spectrometer, and were referenced to 
residual solvent resonances of acetonitrile (MeCN-d;) or dichlorometh- 
ane (DCM-d,) or externally (“B, 85% (Et,O)BF,; »P, 85% H;PO,). Chemical 
shifts (6) were recorded in parts per million. All“B NMR spectra were 
processed using MestReNova software to reduce a background signal 
with a linewidth of approximately 3,000 Hz from the Pyrex NMR tubes. 
The NMR time-domain data were first left-shifted to discard the first 
~0.1ms. To correct the linear phase change, alinear prediction was used 
to fill the initial discarded data before the Fourier transform or an ap- 
propriate linear phase correction was applied to the frequency-domain 
data after the Fourier transform. The relaxation times 7, for “P nuclei 
were determined using the inversion recovery method. The delay times 
after the 180° inversion pulse were varied up to the maximum of five 
times of the expected 7, values. The signal recovery curve was fitted 
with an exponential function to extract the 7, values. Subsequent one- 
dimensional spectra were acquired with five times the longest 7, value 
measured for accurate integrations (where T, values were determined 
for TPO: 2.3 s; Mes,P: 5.1s; [Ph,PNPPh,][PF,]: 7.15; 1: 0.51s; 3: 0.64 s; 
and 4: 0.57s). 


UV-Vis. UV-Vis absorption spectra were collected using a Shimadzu 
UV-2401PC spectrophotometer. The UO,” extinction coefficient (¢) 
was experimentally determined to be 7.715 litres mol cm (460 nm) 
at pH 5.4. 


Elemental analyses. Elemental analyses (C, N, H) were performed at 
the University of California, Berkeley using a Perkin Elmer 2400 Series 
Il combustion analyser. 


Cyclic voltammetry. Cyclic voltammetry was performed using a CH 
Instruments Electrochemical Analysis potentiostat, equipped with a 


3-mm-diameter glassy carbon working electrode, a Ag-wire pseudo- 
reference electrode and a Pt-wire counter electrode using a [Bu,N][PF,] 
(0.1M) solution as the supporting electrolyte. Cyclic voltammograms 
were referenced to the Fc/Fc* redox couple. 


Galvanostatic bulk electrolysis. Galvanostatic bulk electrolysis 
cycling experiments were carried out inside an Ar glovebox using a 
Metrohm Autolab PGSTAT128N potentiostat/galvanostat. The full ex- 
perimental setups for both the mono- and biphasic cycling experiments 
are described in sections ‘Monophasic electrochemical capture and 
release of UO,”” and ‘Biphasic electrochemical capture and release 
of UO,2”, 


X-ray crystallography. X-ray crystallography data were collected on 
a Bruker KAPPA APEX II diffractometer equipped with an APEX II CCD 
detector using a TRIUMPH monochromator with a Mo Ka X-ray source 
(wavelength a= 0.71073 A). The crystals were mounted on acryoloop 
under Paratone-N oil, and all data were collected at 100(2) K using an 
Oxford nitrogen-gas cryostream system. A hemisphere of data was 
collected using w scans with 0.5° frame widths (w, angle between the 
X-ray source and the sample). Data collection and cell-parameter deter- 
mination were conducted using the SMART program. Integration of the 
data frames and refinement of the final cell parameters were performed 
using SAINT software. Absorption correction of the data was carried 
out using SADABS. Structure determination was done using direct or 
Patterson methods and difference Fourier techniques. All hydrogen 
atom positions were idealized and rode on the atom of attachment. 
Structure solution, refinement, graphics and creation of publication 
materials were performed using SHELXTL or OLEX’. 


Synthesis of closo-(1,2-(Ph,PO),-1,2-C,B,oH;9) (1). The synthesis of 
compound 1 was accomplished in two steps by modifications to lit- 
erature procedures”, 

Step 1. A solution of nBuLi in hexane (1.6 M, 28.2 ml, 45 mmol) was 
added dropwise to a solution of ortho-carborane (3.1g, 21.5 mmol) in 
dry diethyl ether (250 ml) at -78 °C, resulting in the formation ofa fine 
colourless precipitate. The reaction was slowly warmed to room tem- 
perature. After stirring for 30 min at room temperature, the mixture 
was cooled to 0 °C, and Ph,PCI (7.7 ml, 41.7 mmol) was added dropwise, 
resulting in a pale orange solution with a colourless precipitate. The 
mixture was stirred for 30 min at 0 °C. The solution was brought to room 
temperature and stirred for 30 min and was subsequently warmed to 
reflux and stirred for an additional 30 min. The solution was cooled to 
0 °C, and water (30 ml) was slowly added to the mixture. The mixture 
was allowed to stir for 20 min and was filtered over a glass frit, where 
the resulting solid was washed with additional water (30 ml) and diethyl 
ether (20 ml). The solid was dried under vacuum at 100 °C for 2h. The 
product was recrystallized from a mixture of hexane/toluene and was 
obtained in 81% yield (8.6 g, 16.7 mmol) 'H NMR (400 MHz, CDCI,): 
67.06-7.47 (m, 20H); 0.98-2.66 (broad, 10H)."B NMR (400 MHz, CDCI,): 
6 -0.40, —9.42. "B{/H} NMR (400 MHz, CDCI,): 6 -0.40, -7.21, -9.42. 
31P {1H} NMR (400 MHz, CDCI,): 67.88. 

Step 2. Asolution of H,O, (30% in water, 1.8 ml, 58.7 mmol) was added 
dropwise toa solution of [closo-(1,2-(Ph,P),-1,2-C,B,9H,,)] (from step 1) 
(2.1g, 4.1 mmol) in THF (50 ml). The reaction was stirred for 3h at room 
temperature. The reaction was monitored by 2P NMR for formation of 
an unwanted side-product at 49 ppm. Once this product formed, the 
reaction was discontinued by addition of chloroform. The mixture 
was washed with water and brine, the phases were separated and the 
organic layer was dried with Na,SO,. The solvent was removed, and the 
solid was slowly recrystallized from acetonitrile to yield a colourless 
crystalline solid. The solid was dried under vacuum at 80 °C for several 
hours (1.4 g, 2.6 mmol, 60% yield). Single crystals suitable for X-ray 
crystallography were obtained by vapour diffusion of pentane ina 
saturated THF solution of 1.'H NMR (400 MHz, MeCN-d;,):67.99 (m, 8H), 
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7.63 (m, 4H), 7.54 (m, 8H), 2.5 (broad s, 10H). 4B and "B{{H} NMR 
(400 MHz, MeCN-d;): 6 0.66, -8.75. =P{{H} NMR (400 MHz, MeCN-d;): 
6 22.8. Anal. calc. for C,,H39B,>0,P,: C, 57.35; H, 5.55. Found: C, 57.33; 
H, 5.66. Selected interatomic distances for 1 were C---C: 1.688(4) A, 
P---P: 3.537 A. 


Synthesis of UO,CL,(1),. A 20-ml vial equipped with a magnetic stirbar 
was charged with 1 (54.4 mg, 0.1 mmol) and 2 ml of DCM. Ina separate 
vial, [UO,CL,(THF),], (24.2 mg, 0.025 mmol) was dissolved in 2 ml of 
DCM and added tothe stirring solution of 1, resulting inthe formation 
of a light-yellow suspension. After 1h, the reaction mixture became 
homogeneous. This was stirred for an additional 24 h at room tem- 
perature. The solvent was removed, yielding a yellow solid (62.5 mg). 
Multiple attempts were made to obtain single crystals suitable for 
XRD studies, but failed. 'H NMR (400 MHz, DCM-d,): 6 8.32-6.29 (m, 
20H), 2.5 (broads, 10H)."Band “B{’H} NMR (400 MHz, DCM-d,): 65.84, 
-8.46. PH} NMR (400 MHz, DCM-d,): 638.8 (s), 38.4 (s). Anal. calc. for 
CH ¢oBooCl,O.P,U:1/2CH,Cl,: C, 43.05; H, 4.16. Found: C, 42.76;H, 4.32. 


Synthesis of [CoCp3],[UO,Cl,(nido—1, 2-(Ph,PO),—1, 2—C,B)Hjy)] 
(2a). A20-ml vial equipped with a magnetic stirbar was charged with1 
(54.4 mg, 0.1 mmol) and 6 ml of benzene. Ina separate vial, CoCp; 
(69.1 mg, 0.2 mmol, 2.0 equiv.) was dissolved in 4 ml of benzene and 
added dropwise to the stirring solution of 1. Upon addition, a yellow 
solid immediately precipitated from the reaction mixture, and the 
mixture was stirred for an additional 4 h at room temperature. Stirring 
was discontinued and the solid was allowed to settle to the bottom of 
the vial. The supernatant was decanted and filtered ona plug of celite. 
The solids were washed with benzene (3 x 6 ml) and each washing was 
filtered over the same celite plug. The remaining solids were then dis- 
solved in a minimal amount of MeCN (2 ml) and filtered on the same 
celite plug into a new vial. The MeCN filtrate was collected and the 
volatiles were removed in vacuo, yielding a shiny golden-yellow solid 
(114.3 mg, 0.95 mmol, 95% yield). Single crystals suitable for X-ray crys- 
tallography were obtained by vapour diffusion of Et,O ina saturated 
MeCN solution of 2a at -38 °C. 'H NMR (400 MHz, MeCN-d;): 6 8.05 
(broad m, 8H), 7.24 (broad s, 12H), 1.62 (s, 60H). We note that carborane 
B-H resonances were too broad to be observed. "B and "B{‘H} NMR 
(400 MHZ, MeCN-d;): 6 20.24, -0.62, -18.46, -22.06. *P{'H} NMR 
(400 MHz, MeCN-d,): 629.8. Anal. calc. for C,,H99B,>Co,0,P,: C, 65.88; 
H, 7.54. Found: C, 65.57; H, 7.67. Selected interatomic distances for 2a 
were C--C:2.860 A, P---P: 5.036 A. 


Synthesis of [Bu,N],[(nido-1,2-(Ph,PO),-1,2-C,B,.H;9)] (2b). The syn- 
thesis of compound 2b was accomplished in two steps. 

Step 1.Inthe glovebox, a250-ml round-bottom flask equipped witha 
magnetic stirbar was charged with 1 (272.2 mg, 0.5 mmol) and 20 ml of 
THF, and cooled to -78 °C. Ina separate vial, KC, (182.4 mg, 1.35 mmol, 
2.7 equiv.) was suspended in 20 ml of THF and added slowly dropwise 
to the stirring solution of 1. Upon addition, the KC, suspension began 
to turn grey, and was stirred for 30 min at room temperature. Stirring 
was discontinued and the mixture was filtered over a pad of celite on 
a fine glass frit. The graphite pad was washed additionally with MeCN 
(3x5 ml). The filtrate was collected and the volatiles were removed in 
vacuo, yielding a pale-yellow oil. The oil was redissolved in THF (5 ml) 
and layered with5 ml of pentane, and was recrystallized at —38 °C, yield- 
ing a white solid (234.1 mg, 0.38 mmol, 75% yield). 'H NMR (400 MHz, 
MeCN-d,): 6 7.81 (broad m, 8H), 7.28 (broad m, 12H). Carborane B-H 
resonances were too broad to be observed. “B NMR (400 MHz, MeCN- 
d;): 620.24, -0.62, -18.46, -22.06. *P{‘H} NMR (400 MHz, MeCN-d;): 
631.8. Anal. calc. for C,,H39B,9K,0,P>:THF: C, 51.86; H, 5.51. Found: C, 
51.09; H, 5.47. 

Step 2. A 20-ml vial equipped with a magnetic stirbar was charged 
with [K],[(nido-1,2-(Ph,PO),-1,2-C,BioH,o)" THF] (655.4 mg, 0.94 mmol) 
(Step 1) and 15 ml of DCM, and was cooled to —-78 °C. Ina separate vial, 


[Bu,N][Cl] (528 mg, 1.88 mmol, 2.0 equiv.) was dissolved in 10 ml of 
DCM and added dropwise to the stirring carborane suspension. Upon 
addition, the reaction mixture became homogenous, and after lha 
white precipitate began to settle. This mixture was stirred for 2 h at 
room temperature. Stirring was discontinued and all volatiles were 
removed, yielding a pale-yellow solid. A minimal amount of DCM (3 ml) 
was added and the mixture was filtered ona celite plug. The solids were 
washed with chilled DCM (3 x 2 ml) and each washing was filtered over 
the same celite plug. The DCM filtrate was collected and the volatiles 
were removed in vacuo, yielding a pale-yellow residue; this was tritu- 
rated with pentane (10 ml) and dried in vacuo to yield an off-white solid, 
which was recrystallized from DCM and pentane (690.4 mg, 0.67 mmol, 
71% yield). 'H NMR (400 MHz, MeCN-d;): 6 8.03 (m, 8H), 7.25 (broad s, 
12H), 3.08 (m, 16H), 1.59 (m, 16H), 1.35 (m, 16H), 0.96 (t, 24H). Carbo- 
rane B-H resonances were too broad to be observed. "B and “B{'H} 
NMR (400 MHz, MeCN-d,): 6 20.25, -0.65, -18.42, -22.05. *P{{H} NMR 
(400 MHz, MeCN-d,): 6 30.1. Anal. calc. for C;gH,92B,gN,O,P,: C, 67.67; 
H, 9.99. Found: C, 66.81; H, 10.29. 


Synthesis of [CoCp;],[UO,Cl,(nido—1, 2-(Ph,PO),—1, 2—C,B pH o)] 
(3). A 20-ml vial equipped with a magnetic stirbar was charged with 
[UO,CI,(THF),], (16.9 mg, 0.018 mmol) and 4 ml of MeCN. Inaseparate 
vial, 2a (42.2 mg, 0.035 mmol) was dissolved in 4 ml of MeCN and then 
added dropwise to the stirring solution of [UO,CI,(THF),],, turning 
dark yellow. After stirring for 5 min, a yellow solid began to precipitate 
from the reaction mixture, and this was stirred at room temperature 
for 4h. Stirring was discontinued and the mixture was passed over a 
celite plug, collecting a yellow solid and a yellow filtrate. The solvent 
was removed from the yellow filtrate, yielding a yellow solid, which was 
washed with THF (3 x 2 ml) and passed over another celite plug. The 
filtrate was collected and the volatiles were removed in vacuo, yielding 
a yellow powder. Residual [CoCp5][CI] was removed by selectively re- 
crystallizing the mixture from pyridine/Et,O at -38 °C. The supernatant 
was transferred and the volatiles were removed, collecting a yellow 
solid (36.3 mg, 0.024 mmol, 69% yield). Single crystals suitable for XRD 
studies were grown by vapour diffusion of Et,O into a saturated MeCN 
solution of 3 at room temperature. 'H NMR (400 MHz, MeCN-d;): 68.13 
(m, 10H), 7.35 (m, 10H), 1.66 (s, 60H). Carborane B-H resonances were 
too broad to be observed. "B and "B{'H} NMR (400 MHz, MeCN-d,):6 
0.27, -16.90, -19.69. “P{1H} NMR (400 MHz, MeCN-d,): 651.12. Anal. 
calc. for C,gHo 9B, >Cl,Co,0,P,U-2MeCN: C, 51.70; H, 5.95; N, 1.72. Found: 
C, 51.91; H, 5.88; N, 1.74. Selected interatomic distances and angles for 
3 were C---C: 2.855 A; P--P: 4.697 A; O1-U-O2: 86.5°(3°). 


Synthesis of [(CoCp;],[UO,(nido—1, 2-(Ph,PO),—1, 2—C,BjyH jp) 24). 
A 20-ml vial equipped with a magnetic stirbar was charged with 2a 
(26.6 mg, 0.022 mmol) and 1.5 ml of MeCN. In a separate vial, 
[UO,CI,(THF),], (5.3 mg, 0.0055 mmol) was dissolved in1.5 ml of MeCN 
and then added dropwise to the stirring solution of 2a. After stirring 
for 5 min, a yellowsolid began to precipitate from the reaction mixture, 
and this was stirred at room temperature for 24 h. Stirring was discon- 
tinued and the solid was allowed to settle to the bottom of the vial. The 
supernatant was decanted and filtered onacelite plug. The solids were 
washed with MeCN (3 x 2 ml) and each washing was filtered over the 
same celite plug. The remaining solids were then dissolved ina minimal 
amount of pyridine and filtered on the same celite plug into anew vial. 
The pyridine filtrate was collected and the volatiles were removed in 
vacuo, yielding alight-yellow powder (35.5 mg, 0.018 mmol, 80% yield). 
Single crystals suitable for XRD analysis were grown froma small-scale 
reaction in aJ. Young NMR tube, which was charged with a solution of 
2a (12.0 mg, 0.01 mmol) in MeCN-d, (0.25 ml). A solution of 
[UO,CL,(THF),], (2.4 mg, 0.0025 mmol) in MeCN-d, (0.25 ml) was add- 
edto this, whereupon crystals suitable for XRD analysis slowly formed 
on the walls of the NMR tube. 'H NMR (400 MHz, MeCN-d,): 68.13 (m, 
15H), 7.38 (m, 10H), 7.24 (m, 15H), 1.68 (s, 60H). Carborane B-H 


resonances were too broad to be observed. "B and "B{{H} NMR 
(400 MHz, MeCN-d): 6 0.26, -17.15, -20.66. “P{{H} NMR (400 MHz, 
MeCN-d,): 652.0. Anal. calc. for Cy,Hj29B29C0,0,P,U: C, 54.76; H, 5.99. 
Found: C, 55.22; H, 6.36. Selected interatomic distances and angles for 
4 were C---C: 2.857 A; PP: 4.806 A; O1-U-O2: 89.7°(17°). 


Chemical capture and release of UO,”* 

Formation of in situ generated [(UO,)(TPO).ClL,]. A 20-ml vial 
equipped with a magnetic stirbar was charged with 1 (2.0 equiv., 5.4 mg, 
0.01mmol), TPO (4.0 equiv., 5.6 mg, 0.02 mmol) and Mes,P (4.0 equiv., 
7.7 mg, 0.02 mmol) and dissolved in a 3:1 PC:benzene (3 ml) solvent 
system. A 500-pl aliquot was taken from this mixture and placed in 
an NMR tube equipped with a MeCN-d, capillary tube. A P{{H} NMR 
spectrum was collected and the relative integrations were recorded 
(Extended Data Fig. 5a). The NMR solution was returned to the vial 
and [UO,CI,(THF),], (0.5 equiv., 2.4 mg, 0.0025 mmol) was added. 
The mixture was stirred vigorously until all the solids were dissolved 
(-20 min), resulting ina light-yellow coloured solution. After 1h, a500-pl 
aliquot was taken from the reaction mixture and placed in an NMR tube 
equipped with a MeCN-d, capillary tube. A “P{{H} NMR spectrum was 
obtained (Extended Data Fig. 5b). After the spectrum was recorded, the 
NMR sample was transferred back into the reaction mixture. 


Reduction. CoCp;(4.0 equiv., 6.6 mg, 0.02 mmol) in100 pl of benzene 
was added dropwise to the generated solution described above. Upon 
addition, the solution turned golden in colour and was stirred for 1h 
at room temperature. A 500- I aliquot was then taken from the reaction 
mixture and placed in an NMR tube equipped witha MeCN-d, capillary 
tube. A »P{'H} NMR spectrum was obtained (Extended Data Fig. 5c), 
and then the NMR solution was transferred back into the reaction mix- 
ture. 


Oxidation. [Fc][PF,] (4.0 equiv, 6.6 mg, 0.02 mmol) was added to the 
reduced solution described above. Upon addition, the solution turned 
green and thenagoldencolour. The solution was stirred for lhat room 
temperature, after which a 500-l aliquot was taken from the reaction 
mixture and placed in an NMR tube equipped witha MeCN-d, capillary 
tube. A *P{‘H} NMR spectrum was obtained (Extended Data Fig. 5d). 
After the spectrum was obtained, the NMR sample was transferred 
back into the reaction mixture. 


Monophasic electrochemical capture and release of UO," 
Experimental conditions. Galvanostatic bulk electrolysis experiments 
were carried out ina divided glass H-cell (Extended Data Fig. 8a, b). The 
physical barrier between each component of the cell and the respective 
two Bio-Logic high-surface coiled Pt electrodes was an anion-exchange 
membrane (Membranes International, AMI-7001) held in place by two 
fluorinated ethylene propylene-encapsulated silicon o-rings with a 
metal clamp. The electrodes were cleaned by rinsing with distilled water 
and acetone and then heating until white-hot with a butane torch before 
use. The anion-exchange membrane was soaked ina 0.1M [Bu,N][PF,] 
solution of PC/benzene (3:1) over 3-A molecular sieves for 24 h before 
use. The left compartment, containing the counter electrode, consisted 
of Fc (41.9 mg, 0.225 mmol) and [Fc][PF,] (74.5 mg, 0.225 mmol) in 
7.0 ml of a0.1M [Bu,N][PF,] PC:benzene solution. The right compart- 
ment, containing the working electrode, contained 1 (5 equiv., 40.8 mg, 
0.075 mmol), TPO (6 equiv., 25.0 mg, 0.09 mmol), [UO,CI,(THF),], 
(0.5 equiv., 7.3 mg, 0.0075 mmol of dimer (1.0 equiv. of U monomer)) 
and [Ph,PNPPh,][PF,] (1.0 equiv., 10.2 mg, 0.015 mmol) ina0.1M [Bu,N] 
[PF,] PC:benzene solution (7.0 ml). 


Experimental parameters. The cell was charged/discharged over the 
course of six cycles. To initiate UO,”* capture, the first cycle was charged 
with an applied current of -201.0 pA over the course of 6 htoa 75% SOC 
relative to the [UO,CI,(THF),], concentration. After the cell was charged, 


await period of 2h was incorporated between charge/discharge cycles 
(Fig. 2c, grey dashed). UO,”* release was achieved by discharging the 
cell galvanostatically at an applied current of 68.94 JA over the course 
of 13 h, using voltage cutoffs (0.0 V), to a final SOC of approximately 
15% relative to the initial [UO,CI,(THF),], concentration. After each 
cell discharge, a wait period of 4-5 h was incorporated (depending on 
when the voltage cutoffs were applied) between discharging/charging 
cycles. Each additional cycle thereafter was charged and discharged 
galvanostatically at currents of -160.87 LA and 68.94 LA, respectively. 
This resulted in charging cycles of ~15—75% SOC and ~75-15% SOC, re- 
spectively. Between each charge/discharge a “P{’H} NMR spectrum 
was obtained (Extended Data Fig. 6a) using a40-s relaxation delay (see 
NMR details above) with [Ph,PNPPh,][PF,] as the standard. We note 
that an excess of 1 was used to keep the applied current (/,,,) below 
the limiting current at any given time (/,(0) for the presumed electro- 
chemical-chemical mechanism involving reduction of 1 followed by 
uranyl ligation. This allowed the use of a galvanostatic charge/discharge 
procedure operating close to the mass-transfer-controlled plateau 
(similar to potentiostatic methods) but with the added benefit of not 
requiring prior knowledge of the optimal applied voltage, whichis a 
function of both the onset of the reductive process and the total cell 
impedance”. Attempts at GBE with stoichiometric equivalents of 1 
revealed an earlier-than-expected onset of/,,, > /,(¢), clearly indicating 
that additional and unwanted electrochemical processes were being 
accessed and perhaps suggesting an initial degradation of 1 within the 
system. Therefore, an initial ratio of 0.5:6:8 for [VO,CI,(THF),],:1:TPO 
reagents was used. 


Biphasic electrochemical capture and release of UO,”* 
Experimental conditions. A complete, stepwise, half-cell figure of 
the experiments conducted in this section, along with spectroscopic 
data, are shown in Extended Data Fig. 7. Two-electrode galvanostatic 
bulk electrolysis was performed in an argon glovebox using a two- 
compartment H-cell with a glass-frit separator, a stir bar in each com- 
partment, and reticulated vitreous carbon (RVC) foam electrodes for 
both the working and counter electrodes (Extended Data Fig. 8c, d). 
The RVC foam electrodes consisted of a ~5-cm steel rodinserted intoa 
100 PPI Duocel RVC foam core (length ~2.5 cm; diameter ~3 mm), witha 
tap bore (length ~5 mm; diameter ~2 mm), which was filled with molten 
gallium to fuse the steel connector to the RVC foam. Each electrode had 
an end-to-tip resistance of <5 Q. The RVC electrodes were rinsed with 
methanol and dried under reduced pressure overnight before use. The 
Ketjenblack used was dried for 48 hina175 °C oven and ground ina glass 
mortar and pestle under inert atmosphere before use. 


Reduction (charging). The counter compartment consisted of 400 mg 
of Ketjenblack suspended in 6 ml of a0.1M solution of [Bu,N][PF,] in 
DCE. The working compartment consisted of 1 (34 mg, 0.0625 mmol, 
1.0 equiv.) dissolved in 6 ml of a 0.1M solution of [Bu,N][PF,] in DCE. 
Acharging current of -107.1 pA with a—9.25 C charge cutoff was used, 
resulting in a~75% SOC after 24 h assuming 100% Coulombic efficiency. 
Upon completion, the working compartment solution was analysed by 
31P/H} NMR spectroscopy to reveal the formation of 2b (Extended Data 
Fig. 9a). The working compartment solution was then removed from 
the H-cell and placed in a 20-ml vial for subsequent capture chemistry. 


UO,” capture. A 5-ml vial was charged with excess UO,(NO,),(THF), 
(42 mg, 0.078 mmol, 1.25 equiv.) and dissolved in 3 ml of a0.1M so- 
dium acetate buffer adjusted to pH 5.4 (0.026 MUO,”). An aliquot of 
the resulting pale-yellow solution was used to record an initial UV-Vis 
spectrum (Extended Data Fig. 10a, blue). The aliquot was transferred 
back to the 5-ml vial and this solution was added slowly, dropwise and 
without stirring to the DCE solution containing the electrochemically 
reduced 1 (forming 2b). After addition, the mixture was allowed to stir 
for 2h, resulting ina bright-yellow organic phase and a very-pale-yellow 
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aqueous phase. Stirring was discontinued and the organic and aqueous 
phases were separated using a small separatory funnel. An aliquot of the 
aqueous phase was used to record a UV-Vis spectrum (Extended Data 
Fig. 10a, red), indicating that 0.022 mmol of UO,”* remained, which is 
equivalent to the capture of 0.056 mmol (-0.9 equiv.) to the organic 
phase. A 1-ml aliquot was taken from the pale-yellow dichloroethane 
layer and transferred to an NMR tube. An unlocked “P{‘H} NMR spec- 
trum was collected indicating the formation of 3N/4N (Extended Data 
Fig. 9b). The NMR solution was returned to the 20-ml vial. 


Oxidation (discharging). Two-electrode galvanostatic bulk electrolysis 
(discharging) of the captured DCE solution was performed using the 
same cell used for charging. A discharging current of 107.1 L.A was applied 
until 9.49 C of charge was transferred, resulting in a final SOC of ~0% (as- 
suming 100% Coulombic efficiency and no loss of material during the 
biphasic capture). Upon completion, the working compartment solution 
was removed and placed ina20-ml vial for subsequent release chemistry. 


UO,”* release. The 20-ml vial containing the electrochemically oxidized 
3N/4N yellow DCE solution was equipped with a stirbar, and asolution 
of 0.1M sodium acetate buffer adjusted to pH 5.4 (3 ml) was added 
dropwise. The mixture was allowed to stir for 12 h, resulting in a pale- 
yellow aqueous phase and a colourless organic layer. The organic and 
aqueous phases were separated using a small separatory funnel, and an 
aliquot of the aqueous layer was used to take a UV-Vis spectrum indicat- 
ing the presence of released UO,”* (0.031 mmol, -0.5 equiv.) (Extended 
Data Fig. 10b). A1-ml aliquot was taken from the yellow DCE layer and 
transferred to an NMR tube. An unlocked #!P{{H} NMR spectrum was 
collected that indicated the clean formation of landasmall unknown 
byproduct at 20.1 ppm (Extended Data Fig. 9c). 


Biphasic control experiments 

UO,” migration from water to DCE in the absence of carborane 
(1 or 2a/b). A 5-ml vial was charged with UO,(NO,),(THF), (14.0 mg, 
0.026 mmol) and dissolved in 1.5 ml of a0.1M sodium acetate buffer 
adjusted to pH 5.4 (0.017 M UO,”*). An aliquot of the resulting pale- 
yellow solution was used to record an initial UV-Vis spectrum (Extended 
Data Fig. 10c, blue). The aliquot was transferred back to the 5-ml vial. 
Aseparate 20-ml vial equipped witha stirbar was charged with [Bu,N] 
[PF,] (0.2324 g, 0.1M) dissolved in DCE (6.0 ml). Tothe clear DCE solu- 
tion, the pale-yellow aqueous solution was added slowly dropwise over 
the course of 2 min without stirring. After addition, the mixture was al- 
lowed to stir for 4h and the organic phase remained clear. Stirring was 
discontinued and the organic and aqueous phases were separated using 
asmall separatory funnel. Small aliquots of the aqueous (Extended Data 
Fig. 10c, red) and organic (Extended Data Fig. 10d) phases were used to 
record UV-Vis spectra, which together clearly indicated that the UO,”* 
had remained in the aqueous phase. 


UO,” migration from water to DCE in the presence of 1. A 5-ml vial 
was charged with UO,(NO,),(THF), (14.0 mg, 0.026 mmol, 1.0 equiv.) 
dissolved in 1.5 ml of a0.1M sodium acetate buffer adjusted to pH 5.4 
(0.017 M UO,”*). An aliquot of the resulting pale-yellow solution was 
used to record an initial UV-Vis spectrum (Extended Data Fig. 10e, blue). 
The aliquot was transferred back to the 5-ml vial. A separate 20-ml vial 
with a stirbar was charged with 1 (14.1 mg, 0.026 mmol, 1.0 equiv.), 
[Bu,N][PF,] (0.2324 g, 0.1M) and DCE (6.0 ml). To the clear DCE solu- 
tion, the pale-yellow aqueous solution was added slowly dropwise over 
the course of 2 min without stirring. After addition, the mixture was 
allowed to stir for 3 hand the organic phase remained clear. Stirring was 
discontinued and the organic and aqueous phases were separated using 
asmall separatory funnel. Small aliquots of the aqueous (Extended Data 
Fig. 10e, red) and organic (Extended Data Fig. 10f) phases were used 
to record UV-Vis spectra, which together clearly indicated negligible 
transfer of UO,”* from the aqueous to the organic phase. 


DFT studies 

DFT calculations were performed using Gaussian 09.2°°. Geometry 
optimizations for all molecules were performed using the B3LYP/def2- 
SVP*”? level of theory (see Supplementary Information for atom coor- 
dinates) in DCM using the conductor-like polarizable continuum model 
(CPCM) implemented in the Gaussian 09 software” “. Electron density 
surfaces with electrostatic potentials were extracted from optimized 
land 2a (Extended Data Fig. 1d). Thermal energy corrections were 
extracted fromthe results of the frequency analyses performed at the 
same level of theory. Frequency analyses of all molecules and intermedi- 
ates contained no imaginary frequency, showing that these are energy 
minima. The equilibrium constants were calculated from the Gibbs 
free energy values G for the proton-transfer reactions (K.,=e “7”; R, 
molar gas constant; 7, temperature). See Supplementary Information 
for the competition reaction results and atom coordinates. 
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1 2a 3* 4" 
CC 1.688(4) A 2.860 A 2.855 A 2.857 A 
P+<P 3.537 A 5.036 A 4.697 A 4.806 A 
O1-U-O2 86.5(3)° 89.7(17)° 


*Solid-state molecular structures 3 and 4 are shown in Figure1. 


Extended Data Fig. 1| Solid-state molecular structures and electron density for complexes 1, 2a, 3 and 4. d, Electron density surfaces with colour-coded 
surfaces. a,b, Solid-state molecular structures of complexes 1 (a) and 2a (b). electrostatic potentials obtained from DFT calculations using optimized 
Hatoms, [CoCp; counter cations (2a) and all co-crystallized solvent structures of Land the anion of 2a, labelled 2 (negative values (red) are 
molecules are omitted for clarity. c, Selected interatomic distances and angles indicative of higher electron density). 
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Extended Data Fig. 2| Cyclic voltammetry data for 1and 4. a, Cyclic 
voltammogram of 1(0.5mM) ina0.1M [Bu,N][PF,] THF solution, measured 
using a3-mm-diameter glassy carbon working electrode anda platinum-wire 


counter electrode, referenced to the Fc*/Fc redox couple (scanrate, 100 mVs‘’). 


The quasi-reversible redox event exhibits two cathodic waves at —0.93 V and 
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-1.11V. b, Cyclic voltammogram of 4 (1.0 mM) ina0.1M[Bu,N][PF,] PC solution, 
obtained using a3-mm-diameter glassy carbon working electrode anda 
platinum-wire counter electrode, referenced to the Fc*/Fc couple (scan 

rate, 100 mVs‘). The quasi-reversible redox event exhibits one anodic wave at 
-0.42V. 
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Extended Data Fig. 3|“P{'H} NMR spectra fromcompetitionexperimentsof = DCM-d,.Rapid precipitation ofa yellow solid was observed. Top, “P{'H} NMR 
1,2aand TPO. a, “P{'H} NMR spectrum of 2.0 equiv. 1 with 1.0 equiv. spectrum of the DCM-d, supernatant. An unknown byproduct at 47 ppmis 
UO,CI,(TPO), in DCM-d,. Relative integrations are shown in red. b, »P{{H} NMR observed. Bottom, “P{'H} NMR spectrum of the filtrate dissolved in PC witha 
spectra of in situ reactions of 1.0 equiv. 2a with 1.0 equiv. UO,CI,(TPO), in DCM-d, capillary tube. 
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Extended Data Fig. 4 |*P{‘H} NMRspectrafromcompetitionexperimentsof after the addition of 20.0 equiv. PC to the solutionina_c, *P{‘H} NMR spectrum 
PC with UO,CI,(TPO),.a, Initial “P{{H} NMR spectrum of UO,C1,(TPO), in after the addition of 40.0 equiv. PC to the solutionina. 
DCM-d,, displaying the proposed cis:trans isomers. b, *P{{H} NMR spectrum 
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Extended Data Fig. 5 | **P{’H} NMR spectra for chemical capture and release b, »P{'H} NMR spectrum of 4.0 equiv. TPO and 2.0 equiv. Lin the presence of 

of UO,”*. Reactions were carried out ina PC:benzene (3:1) solvent mixturewitha 0.5 equiv. [UO,CI,(THF),],. c, “P{{H} NMR spectrum of 4.0 equiv. CoCp; addedto 
MeCN-d, capillary tube insert using Mes,P as the analytical standard. A the reactioninb.d, *P{'H} NMR spectrum after addition of 4.0 equiv. [Fc][PF,] 
relaxation delay of 30 s was used to obtain accurate integrations of all species tothe reactioninc. The reaction conditions are detailed in Methods. 

(see Methods). a, “P{‘H} NMR spectrum of 4.0 equiv. TPO and 2.0 equiv. 1. 
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Extended Data Fig. 6 |Monophasic electrochemical capture and release of 
UO," by GBE. Reactions were carried out ina PC:benzene (3:1) solvent mixture 
witha MeCN-d, capillary tube insert with [Ph,PNPPh,][PF,] as the analytical 
standard. A relaxation delay of 40s was used to obtain accurate integrations of 
all species (see Methods). a, A: “P{‘H} NMR spectrum of 6.0 equiv. TPO and 

5.0 equiv. 1, with 1.0 equiv. [Ph,PNPPh,][PF,] as the analytical standard. B: “P{{H} 
NMR spectrum of 6.0 equiv. TPO and 5.0 equiv. lin the presence of 0.5 equiv. 
(UO,CI,(THF),],. Cycles 1-6: "P{{H} NMR spectra of charged (blue) and 
discharged (red) solutions. An unknown species begins to appear at 45 ppm 


after multiple cycles. Detailed experimental conditions are givenin Methods. 
b, Plot of integrated values for all P-containing species, obtained from the 
charged spectra versus the charge cycle number. The repeated cycling resulted 
inloss of electrochemically generated 3N (average loss of 15.6% per cycle) due 
to presumed chloride migration over the anion-exchange membrane. There 
was little change in the yield of TPO (average loss of 0.3% per cycle) with larger 
losses in1 (average loss of 7.2% per cycle) and 4N (average loss of 3.4% per cycle). 
The values of per cent loss per cycle were estimated from the calculated 
trendlines by taking the ratio of the slope versus the y-intercept values. 
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Extended Data Fig. 7 | Stepwise procedure for the biphasic electrochemical 
capture and release of UO,”* (yellow U). For simplicity, only half of the H-cell is 
displayed here. For the full cell design, see Extended Data Fig. 8c, d.a, »P{'H} 
NMR spectrum of DCE layer containing only land [Bu,N][PF.] ([PF,] resonance 
not shown) before charging. b, Top inset, UV-Vis spectrum of aqueous phase 
containing 1.25 equiv. UO,” (from UO,(NO,),(THF),) before mixing with the 
DCE phase. Bottom inset, “P{1H} NMR spectrum of DCE layer containing 2b 
(major) and 1 (minor) after charging 1 (from a) galvanostatically. c, Mixing of 
the phases inb for 2h. d, Top inset, UV-Vis spectrum of aqueous phase after 
mixing with the DCE phase, revealing approximately 0.35 equiv. UO,”* 
remaining. Bottom inset, “P{{H} NMR spectrum of the DCE layer after mixing 


50 45 40 35 30 25 20 


with aqueous phase, showing captured products 3N/4N (major) and1 (minor). 
e, PH} NMR spectrum of the DCE layer following phase separation and 
galvanostatic discharge. A broad peak is observed at 38 ppm, which we 
attribute to an adduct of UO,” with 1. This, together with the broadened peak 
of 1, accounts for -75% of products. The remaining unknown byproducts are 
marked with # or *.f, Top inset, UV-Vis spectrum of aqueous phase after 
addition of fresh buffer to the discharged DCE solution (in e) and mixing for 
12h. The spectrum reveals the release of approximately 0.50 equiv. UO,”". 
Bottom inset, “P{‘H} NMR spectrum of the DCE layer after mixing with fresh 
buffer, showing the free carborane 1 (major), as well as unknown byproducts at 
44 ppm and 20 ppm marked by * (-20% of total). 
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Extended Data Fig. 8 | Diagrams for mono- and biphasic electrochemical 
cells. a, Schematic of the divided H-cell used for the monophasic galvanostatic 
bulk electrolysis cycling experiments with UO,”*. b, Photograph of the divided 
H-cell used for the monophasic galvanostatic bulk electrolysis cycling 
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experiments with UO,”".c, Schematic of the two-compartment H-cell used for 
the biphasic electrochemical capture and release of UO,”*.d, Photograph of the 
two-compartment H-cell used for the biphasic electrochemical capture and 
release of UO,”". 
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Extended Data Fig. 9 | P{‘H} NMR spectra for biphasic electrochemical sodium acetate buffer. c, *P{H} NMR spectrum of an electrochemically 
capture and release of UO,”*. a, *P{{H} NMR spectrum of electrochemically oxidized DCE layer containing 3N/4N following extraction of UO,” into 0.1M 


reduced 1in DCE to produce 2b. b, »P{’H} NMR spectrum of DCE layer following sodium acetate buffer. Minor unknown byproducts (marked by asterisks) are 
UO,” capture from the aqueous layer containing UO,(NO;),(THF), in 0.1M also observed. Detailed experimental conditions are found in Methods. 
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Extended Data Fig. 10 | UV-Vis spectra for the biphasic electrochemical 
capture and release of UO,” and controls. a, Initial UV-Vis spectrum of 
UO,(NO;)3(THF), (0.042 g, 0.078 mmol, 0.026M, 1.25 equiv.) in3 ml of 0.1M 
sodium acetate buffer at pH 5.4 (blue). UV-Vis spectrum taken after mixing the 
aqueous layer with the DCE layer of electrochemically reduced 1 (to generate 
2b) ina0.1M[Bu,N][PF,] DCE solution for 2h, indicating a residual 
concentration of 0.0073M, consistent witha total quantity of captured UO,”* 
to the DCE layer of 0.056 mmol (red). b, UV-Vis spectrum of 0.1M aqueous 
sodium acetate buffer layer at pH 5.4 after mixing for 12 hwith 
electrochemically oxidized 3N/4N in DCE. The concentration of UO,” was 
calculated to be 0.010 M, consistent with a total quantity of 0.031 mmol of 
released UO,” from the DCE layer to the aqueous phase. c, Control for UO,”* 
migration from water to DCE inthe absence of carborane (1or 2a/b). Initial 
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UV-Vis spectrum of UO,” in 0.1M sodium acetate-buffered solution at pH5.4 
(blue). UV-Vis spectrum of aqueous layer after mixing for 4h with DCE solution 
containing [Bu,N][PF,] (0.1M; red). d, Corresponding UV-Vis spectrum of DCE 
layer after mixing for 4 h with the aqueous layer containing UO,”* showninc. 
e, Control for UO,”* migration from water to DCE in the presence of neutral 
carborane (1). Initial UV-Vis spectrum of UO,”* (1.0 equiv.) in 0.1M sodium 
acetate-buffered solution at pH 5.4 (blue). UV-Vis spectrum of aqueous layer 
after mixing for 3 h with DCE solution containing [Bu,N][PF,] (0.1M) and 

1(1.0 equiv.; red). f, Corresponding UV-Vis spectrum of DCE layer containing 
lafter mixing for 3 h with the aqueous layer containing UO,” shown ine. The 
UO,” extinction coefficient was experimentally determined to be 

7.715 Lmol'cm™ (460 nm) at pH 5.4. See Methods for experimental details. 
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Functionalization of the B-C-H bonds of aliphatic acids is emerging as a valuable 
synthetic disconnection that complements a wide range of conjugate addition 


reactions’ >. Despite efforts for B-C-H functionalization in carbon-carbon and 
carbon-heteroatom bond-forming reactions, these have numerous crucial 
limitations, especially for industrial-scale applications, including lack of mono- 
selectivity, use of expensive oxidants and limited scope® ’. Notably, the majority of 
these reactions are incompatible with free aliphatic acids without exogenous 
directing groups. Considering the challenge of developing C-H activation reactions, 
it is not surprising that achieving different transformations requires independent 
catalyst design and directing group optimizations in each case. Here we report a Pd- 
catalysed B-C(sp*)-H lactonization of aliphatic acids enabled by amono-N-protected 
B-amino acid ligand. The highly strained and reactive -lactone products are versatile 
linchpins for the mono-selective installation of diverse alkyl, alkenyl, aryl, alkynyl, 
fluoro, hydroxyl and amino groups at the B position of the parent acid, thus providing 
aroute to many carboxylic acids. The use of inexpensive tert-butyl hydrogen peroxide 
as the oxidant to promote the desired selective reductive elimination from the Pd(Iv) 
centre, as well as the ease of product purification without column chromatography, 
render this reaction amenable to tonne-scale manufacturing. 


Alkyl carboxylic acids are ubiquitous and inexpensive reagents in 
organic chemistry; as such, they are favourable substrates for C-H 
activation reactions*». The scope of such transformations is often 
limited by the incompatibility of certain reaction partners. Indeed, 
for C-C bond formations, alkylation reactions are limited to primary 
alkyl iodide or alkyl boron coupling partners® °; olefination reactions 
are applicable only to electron-deficient olefins*”’; alkynylation reac- 
tions are limited to silyl acetylene bromide"; and arylation reactions 
are compatible only with aryl iodides, but not with the more practical 
aryl bromides and chlorides”, despite the design of various directing 
groups. Most importantly, carbon-heteroatom (C-Y) bond-forming 
reactions (such as fluorination, hydroxylation and amination) based 
onB-C-H activation of free aliphatic acids have not yet been realized. 
Considering these persistent limitations of the conventional B-C-H 
activation approach, we turned toa one-for-all B-lactonization strategy 
(Fig. 1a). B-Lactones are strained heterocycles that have received consid- 
erable attention as valuable synthetic intermediates in the syntheses of 
natural and unnatural products’. Owing to their inherent ring strain, 
they readily react with a wide range of nucleophiles by either acyl C-O 
or alkyl C-O bond cleavage. The lack of precedent of this reaction is 
probably due to the highly unfavoured four-membered lactonization 
transition state’. Notably, this B-lactonization could providea strategy 
to synthesize carboxylic acids containing «-quaternary centres that are 
inaccessible by conjugate addition chemistry, and difficult to prepare 
via a-substitution’®. 

Amixture of K,PtCl,, (17 mol%) and K,PtC1, (33 mol%) can promote the 
formation of y-lactones from aliphatic acids in 16% yield, accompanied 


by 2% B-lactone’”"®, y-Lactonization of benzylic C-H bonds has also been 
reported using Pd and Pt catalysts’””°. These observations indicate that 
B-lactonization is a highly disfavoured process. Guided by previous 
work using a bystanding oxidant to promote C-H activation/cycliza- 
tion reactions”, we investigated catalysts and conditions to achieve a 
B-C-H lactonization reaction. Compared to B-lactam formation, where 
anucleophilic directing group can be employed to forma strong C-N 
bond”3”*, B-C-H lactonization poses an additional challenge because 
of the low nucleophilicity of the carboxylic acid, the strain generated 
in forming a four-membered ring and the facile ring opening under 
C-H activation conditions. Most problematically, Pd(Iv) intermediates 
could readily undergo conventionally favoured reductive elimina- 
tion to produce non-cyclic C-O bond-formation products, such as the 
most common competing pathways acetoxylation and alkoxylation 
(Fig. 1b). We selected 2,2-dimethylbutyric acid la as a model substrate 
inour search for reactivity witha wide range of oxidants and catalysts. 
Exploratory studies using various common oxidants for Pd(11)/Pd(Iv) 
chemistry—such as PhI(OAc),, K,S,O, and F* reagents—consistently 
gave undesired non-cyclic oxidation products (see Supplementary 
Information Table 3 and section ‘Mechanistic studies’ for details). 
To avoid the undesired reductive-elimination pathway, we tested the 
sterically bulky oxidant tert-butyl hydrogen peroxide (TBHP)*, as well as 
PdCl,-derived catalysts, because the ‘BuO and Clanions are less prone 
to reductive elimination due to sterics and electronics. The desired 
B-lactone 2a was formed in 15% 1H NMR (nuclear magnetic resonance) 
yield using a combination of Pd(CH,CN),CI,, TBHP oxidant, CSHCO, 
and 1,1,1,3,3,3-hexafluoro-2-propanol (HFIP) solvent. Encouragingly, 
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Fig. 1| B-C(sp*)-H functionalization. a, Lactonizationas a general and scalable route to B-C(sp*)-H functionalization. b, Challenges: multiple reductive- 


elimination pathways of Pd(Iv) centres. Nu, nucleophile (acid, solvent); L, ligand. 


noy-lactone or B-, y-hydroxylated products were observed during the 
reaction. The unique role of TBHP in favouring B-lactone formation 
can be rationalized on the basis of studies on the oxidation of Pd(11) 
to Pd(1v) by benzoyl peroxide” and TBHP*. Following the oxidation of 
Pd(II) to Pd(Iv) by TBHP, ‘BuO and HO" bound toa Pd(Iv) centre are less 
likely to undergo rapid reductive elimination due to the strong Pd-O‘Bu 
(OH) bond. According to the principle of organometallic chemistry, the 
steric hindrance of ‘BuO’ could also enhance the reductive elimination 
of the carboxylate from the substrate to generate B-lactone product. 


In light of the recent advances in ligand-accelerated Pd(II)-catalysed 
C-Hactivation”®, we next searched for ligands that could substantially 
improve the reactivity of the catalyst. It is also possible that an appropri- 
ate ligand could enhance the otherwise unfavoured B-lactonization. 
Using the mono-N-protected a-amino acid (MPAA) ligand N-acetyl 
glycine L1, the yield was improved to 36%. Modification of the backbone 
of the a-amino acid ligand led only to minor improvements (L2to LS). 
Considering the challenging reductive elimination of a strained four- 
membered ring from Pd(Iv), we reasoned that switching the ligand 
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Br. 
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2z, 59% n= 2, 2aa, 68% 2ac, 51% 2ad, 82% n=8, 2ae, 69% 2ag, 40% 


n= 3, 2ab, 90% 


Fig. 2| Aliphatic acid scope for B-C(sp*)-H lactonization. Conditions for 2ato 
2s and 2wto 2z:1(0.1mmol), Pd(CH;CN),ClI, (10 mol%), L11 (20 mol%), CSHCO, 
(0.5 equiv.), TBHP (about 5.5 Min decane) (2.0 equiv.), HFIP (1.0 mL), 60 °C, 12h. 


n=4, 2af, 32% 


Conditions for 2t to 2v and 2aa to 2ag: 1(0.1 mmol), Pd(OAc), (10 mol%), L11 
(20 mol%), NaOAc (1.0 equiv.), TBHP (about 5.5 Min decane) (2.0 equiv.), 
HFIP (1.0 mL), 60 °C, 12h. See Supplementary Information for details. 
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Fig.3|Gram-scale B-C(sp*)-H lactonization of Gemfibrozil with 1 mol% 

Pd and diverse transformations. Nu, Grignard reagents (3a to 3j), 
alkynylaluminum reagent (3k), TBACN (31), TBAF (3m), MgBr, (3n), NaN; (30), 
NaNHNs (3p), KOH (3q), PhSNa (3r). Conditions for 3a to 3j: 2v (0.1 mmol), 


binding mode from five- to six-membered chelation will increase 
the bite angle, thereby favouring the desired reductive elimination. 
The B-amino acid-derived ligand N-acetyl B-alanine L6 under the same 
conditions improved the yield to 48%. Building on this promising find- 
ing, we then investigated the influence of substituents on the ligand’s 
side chain. Substituents at the B position slightly reduced the reactivity 
(L7 toL10), suggesting that steric hindrance around the NHAc moiety 
was detrimental to reactivity. Moreover, substitution at the a position 
proved beneficial (L11to L13), with methyl-substituted L11 giving 65% 
yield. The isolated yield of B-lactone could be further improved to 
73% when using TBHP in decane (see Supplementary Information for 
further optimization). 

After determining the optimized ligand and conditions, we explored 
the scope of this methodology (Fig. 2). Aliphatic acids containing 
a-gem-dimethyl groups with various aliphatic chains including cyclobu- 
tanes (2f) were all compatible, affording the B-lactones (2a to 2f) in 
high yields. A range of functionalities—such as fluoro (2g), chloro (2h), 
trifluoromethyl (2i), ketone (2j) and phosphoric ester (2k)—were toler- 
ated, with halogen (2h), ketone (2j) and phosphoric ester (2k) moieties 
serving as useful synthetic handles for subsequent derivatization. The 
lactone products containing a piperidine (21) or atetrahydropyran 
(2m) motif are especially valuable. Different protecting groups on 
the hydroxyl group including simple methyl (Me) (2n), benzyl (Bn) 
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Conditions for 3k to 3r: 2v (0.1 mmol), Nu (2.0 or 3.0 equiv.), 1-12 h. 
See Supplementary Information for details. 


(20), and methoxymethyl (MOM) (2p) were also well tolerated. Phenyl 
(2q to 2r) and phenyl ether (2s to 2v) groups were compatible with the 
TBHP system, and remained intact despite the potentially reactive aryl 
or benzylic C-H bonds. A range of substituents on the aryl ring from 
electron-donating (Me and O-alkyl) to electron-withdrawing (chloro, 
bromo and nitro) groups were all well tolerated. Gemfibrozil (1v), an 
oral drug used to lower lipid levels”, was converted to the correspond- 
ing B-lactone 2v in high yield. This lactone could serve as a versatile 
intermediate for library construction in medicinal chemistry (see 
below). Notably, the remaining a-methyl group from the above cases 
could then undergo further C-H functionalizations to afford greater 
structural diversity. Tertiary aliphatic acids containing a single a-methyl 
group (2w to 2ab) consistently afforded useful yields, in addition to 
those substrates containing a-hydrogens (2ac to 2ag). 

To demonstrate the scalability and practicality of this transforma- 
tion, we conducted a gram-scale B-lactonization of Gemfibrozil (1v) 
with 1 mol% Pd (Fig. 3). Pure product was obtained by a simple aqueous 
wash without chromatography. 1.0 g gemfibrozil (Iv) in HFIP, Pd(OAc), 
(1.0 mol%), commercially available MPAA ligand L6 (2.0 mol%) and 
NaOAc (1.0 equiv.) were added to a reaction tube, followed by TBHP 
(70% in water) (2.0 equiv.). After stirring at 60 °C for 24 h, the HFIP 
solvent was removed by evaporation, followed by dissolution with 
ethyl acetate and washing with saturated NaHCO, solution to remove 


unreacted acid, ligand and metal complex. Evaporation of solvent deliv- 
ered the lactone product 2vin 92% yield. Froma practical standpoint, 
this reaction has several key advantages over other C-H activation 
protocols: (1) the inexpensive oxidant TBHP (US$5 per mole); (2) it 
tolerates air and moisture; (3) it can be reliably scaled up; (4) the aque- 
ous wash delivers the product without chromatography. 

As depicted in Fig. 3, the B-lactone product 2vis a stepping stone for 
mono-selective installation of a range of alkyl, alkenyl, aryl, alkynyl, 
cyano, halogen, amino, hydroxyl and thiophenyl groups”? °. Various 
alkyl (3a to 3e), alkenyl (3f to 3g) and aryl (3h to 3j) Grignard reagents 
in the presence of catalytic copper were able to successfully open the 
B-lactone to build new C-C bonds at the B position of the parent ali- 
phatic acids”*”’. In particular, secondary alkyl structure motifs, such 
as isopropyl (3c), cyclopropyl (3d) and cyclopentyl (3e), could be effi- 
ciently installed; by contrast, the analogous secondary alkyl iodides are 
usually incompatible in Pd-catalysed C-H alkylation reactions. B-Vinyl 
aliphatic acids (3f to 3g) were directly accessible through reaction with 
their corresponding vinyl (3f) and isopropenyl (3g) Grignard reagents, 
which provided a strategy complementary to the Pd-catalysed B-C-H 
olefination of free acids and their derivatives, where only electron- 
deficient olefins were effective. B-Lactone 2v may also be expediently 
elaborated into the corresponding B-arylated aliphatic acids (3h to 3j); 
this approach is particularly crucial in the case of 3i and 3j, because 
the corresponding iodides are often not viable coupling partners. 
The use of Grignard reagents prepared from readily available aryl bro- 
mides or chlorides also provides a practical advantage. Additionally, 
B-phenylacetylene aliphatic acids 3k could be successfully synthesized 
from B-lactone 2v on treatment with alkynyl aluminium reagent. Cya- 
nide could also open the lactone to construct anew C-C bond, afford- 
ing the corresponding B-cyano aliphatic acids 3l. The electrophilicity 
of the B-lactone carbonyl was further exploited by the addition of the 
weak fluoride nucleophile (3m) to introduce a CH,F fragment, a highly 
sought-after bioisostere in medicinal chemistry. By a similar B-lactone 
opening, MgBr, delivered the formally B-brominated aliphatic acid 
3n in high yield, a versatile linchpin for further elaboration. Further 
manipulations of the B-lactone inthe presence of the hard nucleophiles 
NaN, or NaNHNsafforded coveted §-amino acid scaffolds (30 to 3p) in 
consistently high yields. By making use of the B-lactone as a masked 
aldol adduct, mild hydrolysis afforded the B-hydroxyl acid 3q in high 
yield. Finally, the formal B-chalcogenation product 3r was obtained in 
near quantitative yield using thiophenol sodium salt as a nucleophile. 
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Methods 


General procedure for B-C(sp°)-H lactonization 

Pd(CH,CN),CLI, (10 mol%, 2.6 mg), ligand L11(20 mol%, 2.9 mg), CSHCO, 
(0.5 equiv., 9.7 mg) and carboxylic acid 1(0.1mmol), inthat order, were 
weighed in air and placed in a culture tube with a magnetic stir bar. 
Then HFIP (1.0 mL) and TBHP (about 5.5 M in decane) (2.0 equiv., 36 LL) 
were added. The reaction mixture was stirred at room temperature 
for 3 min and then heated to 60 °C for 12h (600 rpm). After cooling to 
room temperature, the mixture was concentrated in vacuo, and the 
resulting mixture was purified by preparative thin-layer chromatog- 
raphy or diluted with ethyl acetate and washed with saturated NaHCO, 
solution. The full experimental details and characterization of the new 
compounds can be found in Supplementary Information. 
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Sea-level histories during the two most recent deglacial-interglacial intervals show 


substantial differences’ * despite both periods undergoing similar changes in global 
mean temperature*” and forcing from greenhouse gases°. Although the last 
interglaciation (LIG) experienced stronger boreal summer insolation forcing than the 
present interglaciation’, understanding why LIG global mean sea level may have been 
six to nine metres higher than today has proven particularly challenging”. Extensive 
areas of polar ice sheets were grounded below sea level during both glacial and 
interglacial periods, with grounding lines and fringing ice shelves extending onto 
continental shelves®. This suggests that oceanic forcing by subsurface warming may 


also have contributed to ice-sheet loss 


Antarctic?" 


°-? analogous to ongoing changes in the 


and Greenland* ice sheets. Such forcing would have been especially 


effective during glacial periods, when the Atlantic Meridional Overturning Circulation 
(AMOC) experienced large variations on millennial timescales”, with a reduction of 


the AMOC causing subsurface warming throughout much of the Atlantic basin 


9,12,17 


Here we show that greater subsurface warming induced by the longer period of 
reduced AMOC during the penultimate deglaciation can explain the more-rapid sea- 
level rise compared with the last deglaciation. This greater forcing also contributed to 
excess loss from the Greenland and Antarctic ice sheets during the LIG, causing global 
mean sea level to rise at least four metres above modern levels. When accounting for 
the combined influences of penultimate and LIG deglaciation on glacial isostatic 
adjustment, this excess loss of polar ice during the LIG can explain much of the relative 
sea level recorded by fossil coral reefs and speleothems at intermediate- and far-field 


sites. 


The evolution of the climate over the last two terminations (T-II, 
about 136-129 ka; T-I, about 19-11.7 ka) shares a number of similari- 
ties (Extended Data Fig. 1). Proxy records of ocean circulation show 
that the last two terminations were accompanied by large reductions 
in the Atlantic Meridional Overturning Circulation (AMOC). Climate 
responses to these reductions show the characteristic bipolar see-saw 
due to reduced northerly ocean heat transport and the weakening of 
the Asian monsoon due to the cooling of the Northern Hemisphere. 
Other similarities include an increase in the rate of sea-level rise when 
the AMOC begins to decrease and the occurrence of a Heinrich event 
during the period of reduced AMOC. Similar climate changes accom- 
panied earlier terminations over the last 640 kyr (ref. '’), suggesting 
that areduced AMOC is a characteristic feature of these periods of 
rapid deglaciation. 

There are also several notable differences between the last two ter- 
minations (Extended Data Figs. 1 and 2). First, proxy data suggest that 
the AMOC during T-II remained in a reduced state for ~7 kyr before 


recovering at the start of the LIG. By contrast, during T-1, the AMOC 
remained weak only for ~3.5 kyr before recovering to nearly full strength 
during the 1.5-kyr Bolling—Allergd warm interval. It then decreased 
again during the 1.2-kyr Younger Dryas cold interval, with its final 
recovery at the start of the present interglaciation. Second, sea level 
rose ~130 m during the 7-kyr sustained ‘one-step’ T-II period of reduced 
AMOC, whereas it rose only ~70 m during the ~6.5-kyr ‘two-step’ T-I 
period of reduced AMOC;, with the remaining ~60 m occurring after 
the start of the present interglaciation, 11.7 ka (Fig. 1). Third, ice-rafted 
debris (IRD) suggests that Heinrich event 11 (H11), whichis nearly twice 
as long as Heinrich event 1 (H1), was sourced from more than just the 
Hudson Strait Ice Stream, which was the primary source for HL”. 
Atransient simulation of T-I climate used an atmosphere-ocean gen- 
eral circulation model (the National Center for Atmospheric Research 
Community Climate System Model version 3; NCAR CCSM3) forced 
by changes in insolation, CO,, ice sheets and freshwater (FW) fluxes 
that, while not in full agreement with reconstructions, were designed 
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Fig. 1|Sea-level change and climate forcings during the penultimate (left) 
and last (right) deglaciations. a, g, Eustatic sea-level record based on benthic 
foraminifera isotopic records (blue line with louncertainty) (a) andina 
compilation of sea-level proxies shown by black dots with depth-range 
uncertainties (g) (see Methods, Extended Data Figure 2). Also shown are RSL 
records based on Red Sea isotopes (grey crosses) placed onarevised age 
model, and U-series-dated corals in Tahiti (sky blue circles), Huon Peninsula 
(light blue-green circles) and western Australia (blue circles) (Methods, 
Extended Data Figure 2). All of the U-series ages have been recalculated and 
normalized with the same set of decay constants for?“*U and”°Th. Error bars 
show 2oage uncertainty. b, h, Insolation on 21June for 65° N (data from ref.’). 
c,i, Change in radiative forcing (R) from GHGs (CO,, CH,, and N,O)°. The 
uncertainty (shaded envelope) is the square root of the sum of squares of the 
uncertainties of the individual GHGs. d,j, Tropical (23.5° N-23.5° S) mean 
annual SST anomalies with 2 s.d. (shaded envelope) relative to the HadISST1.1 
1870-1889 data**. e, k, Changes in the model maximum AMOC transport 
(below 500 m; black line) and temperature (colour scale) as a function of time 
and depth at 45°N, 30° W relative to 140 ka (e) and 22 ka (k)” as simulated by 
NCAR CCSM3 (Methods). f, I, Evolution of temperature (colour scale) asa 
function of time and depth at 70° S, 45° W relative to 140 ka (f) and 22 ka (1)"” as 
simulated by NCAR CCSM3 (Methods). 


to cause the two-step reduction of the AMOCY” (Fig. 1k). The simu- 
lation successfully captured many aspects of the climate evolution 
through T-1 as recorded by proxy records’””°”. Subsurface warming 
throughout much of the Atlantic basin was one of the responses toa 
reduced AMOCY (Fig. 1k, 1), supported by proxy temperature records 
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Fig. 2 | Sea-level change and climate forcings during the last and present 
interglaciations. a, g, Records of relative and global mean sea level (Methods, 
Extended Data Fig. 2). The uncertainty (bars) onthe coral data (circles) is 20. 
b,h, Insolation on 21June for 65° N (data from ref.’).c, i, Radiative forcing from 
GHGs°. The uncertainty is the square root of the sum of squares of the 
uncertainties of the individual GHGs. d,j, Global mean annual SST anomalies 
with 2s.d. relative to the HadISST1.11870-1889 data*”. e, k, Evolution of 
temperature asa function of time and depth at 45° N, 30° W relative to 140 ka (e) 
and 22 ka (k)"”” as simulated by NCAR CCSM3 (Methods). f, I, Evolution of 
temperature asa function of time and depth at 70° S, 45° W relative to 140 ka (f) 
and 22 ka (I)"” as simulated by NCAR CCSM3 (Methods). 


from intermediate-depth (1,000-1,500 m) North Atlantic core 
sites”. 

We used the same climate model to conduct a transient simulation 
that spans T-I] and the LIG (140-115 ka) (Methods). We applied a FW forc- 
ing consistent with reconstructions that reproduced the 7-kyr one-step 
reduction in the AMOC suggested by proxy records of ocean circula- 
tion (Extended Data Figs. 1 and 3) to quantify the associated changes 
in subsurface temperatures during T-II and into the LIG and thus allow 
a direct comparison with the subsurface warming simulated for T-I. 

Figure 1 compares forcing of ice-sheet surface mass balance from 
insolation, greenhouse gases (GHGs) and low-latitude Pacific sea sur- 
face temperatures (SSTs) for T-IIand T-I to representative examples of 
the simulated oceanic forcing at sites in the North (45° N, 30° W) (Fig. le, 
k) and South Atlantic (70° S, 45° W) (Fig. If, 1). Changes in GHG concen- 
trations and SSTs are similar during the two terminations, with increases 
of -2W m” from GHGs and -2 °C of warming from low-latitude Pacific 
SSTs, which strongly influence Northern Hemisphere ice-sheet surface 
mass balance”. Despite these similarities, sea level reached modern 
levels by the end of T-II, while it remained ~50% below modern levels at 
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Fig. 3 | Simulations of the Greenland and Antarcticice sheets. a, The 
Greenland Ice Sheet at the PGM. Shown are the Greenland ice-core sites North 
Greenland Eemian Ice Drilling (NEEM), Greenland Ice Core Project (GRIP) and 
Greenland Ice Sheet Project (GISP). b, The Greenland Ice Sheet at 116 ka. The 
logarithmic colour scale shows the velocity fields and contours on the ice sheet 
indicate the surface elevation (m).c, Change in the thickness of the Greenland 
Ice Sheet between 116 ka and the present day. Contours on the ice surface and 
the colour scale show the thickness change. d, The Antarctic Ice Sheet at the 


the end of T-I (Fig. 1a, g). Some have attributed the faster rate of sea-level 
rise during T-II to the greater boreal summer insolation forcing’, but 
that forcing only exceeds that of T-I after the majority (~80 m) of T-II 
sea-level rise had occurred (Fig. la, b, Extended Data Fig. 2). Otherwise, 
insolation forcing during the first 8 kyr of each termination is similar 
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PGM.e, The Antarctic Ice Sheet at 116 ka. The logarithmic colour scale shows 
the velocity fields and contours on the ice sheet indicate the surface elevation 
(m).f, Change in the thickness of the Antarctic Ice Sheet between 116 kaand the 
present day. Contours on the ice surface and the colour scale show the 
thickness change. g, Contributions of the Greenland Ice Sheet (GrIS) and 
Antarctic Ice Sheet (AIS) to global mean sea level between 140 ka and 116 ka. 
WAIS, West Antarctic Ice Sheet. 


(~55 Wm”), whereas the associated 80 m of sea-level rise during T-Ilis 
much greater than that during T-1(~35 m). 

Given the similar radiative forcing and temperature changes dur- 
ing the last two terminations, we attribute this contrast in sea-level 
response to the greater subsurface warming during T-II associated with 
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Fig. 4| Predictions of RSL at four far-field sites. a, RSL data from the Bahamas 
based on well-dated corals compared with predicted RSL from our simulated 
LIG loss from the Greenland and Antarctic ice sheets and the LAM ice history” 
(solid green line) and the HYB ice history (solid blue line) (see Methods). Also 
shownare predictions of RSL using just the LAM ice history” (dashed green 
line) and the HYB ice history (dashed blue line). b,c, Asina, for Western 
Australia (b) and the Seychelles (c). Horizontal error bars are for age (20), 
vertical error bars are for elevation (in metres; below data points) and coral- 
depth habitat (in metres, above data points). d, RSL data from Mallorca based 
onspeleothem records. Horizontal error bars are for age (20) and vertical error 
bars for the growth range of the speleothem. The Earth models used in the 
calculations are characterized by a lithospheric thickness and upper and lower 
mantle viscosity of: 140 km, 0.3 x 107! Pas, 8.0 x 10? Pas (a),96km, 
0.3x107'Pas, 5.0 x10” Pas (b), 30 km, 0.5 x10”! Pas, 3.0 x 10” Pas (c) and 

120 km, 2.0 x 107!Pas, 8.0 x 10” Pas (d). Each of the coral records comprise data 
collected from multiple sites and the RSL predictions are shown for the 
following representative locations: 24.05° N, 285.47° E (a), 21.97°S, 113.93° E 
(b), 4.28°S, 55.73° E (c) and 39.61° N, 3.38° E (d). The consistency between the 
data and the predictions would be unaffected if we plotted RSL histories at 
each location that accounted for the spatial distances between individual 
sample collection sites. 


the one-step reduction in the AMOC than the T-I two-step reduction, 
leading to greater oceanic forcing of marine ice-sheet margins in the 
North (Fig. le, k, Extended Data Fig. 4) and South (Fig. 1f, 1, Extended 
Data Fig. 4) Atlantic. The Eurasian Ice Sheet during the Penultimate 
Glacial Maximum (PGM, ~140 ka) was also larger than during the 
Last Glacial Maximum (LGM, ~21 ka), with most of the excess mass 
located in low-lying areas south-southeast of the glaciated Barents 
and Kara seas” that was also marine-based owing to isostatic depres- 
sion (Extended Data Fig. 5). We thus hypothesize that collapse of this 
large marine-based ice complex triggered by oceanic forcing would 
also have contributed to the rapid T-II sea-level rise, with the associated 
IRD contribution to H11 diluting the contribution from the Hudson 
Strait Ice Stream”. This greater FW flux from deglaciating Northern 
Hemisphere ice sheets during T-II provided an important positive 
feedback on that deglaciation through its influence on the AMOC and 
subsurface temperatures. 

Figure 2 compares forcings during the last two interglaciations. Peak 
global mean SSTs were similar (Fig. 2d, j) whereas LIG radiative forcing 
from CO, was only slightly higher (~0.25 W m7”) than during the present 
interglaciation (Fig. 2c, i). The main difference is in the higher boreal and 
lower austral summer insolation forcing during the LIG (Fig. 2b, h). Mod- 
elling studies show that this forcing would cause excess mass loss from 
the Greenland Ice Sheet during the LIG, but the estimated 1-3 mincrease 
in global mean sea-level equivalent (GMSLE) is too small to explainthe 
LIG highstand, thus requiring a contribution from the Antarctic Ice 
Sheet”. However, lower austral summer insolation forcing during the 
LIG (Fig. 2b) results in surface cooling over most of Antarctica. This 


suggests that oceanic forcing plays an important role, with warming 
hypothesized to originate from an AMOC reduction during the LIG” 
or from a lagged ice-sheet response to warming from a change in the 
strength and/or position of the Southern Ocean westerlies associated 
with the T-II AMOC reduction’. One ice-sheet model simulates up to 
6.7 m of sea-level rise when specifying a uniform increase in Southern 
Ocean temperatures of 3 °C (ref."). 

Our transient climate simulation shows that T-II oceanic forcing in 
the Southern Ocean, as well as the North Atlantic, continued into the 
early LIG (Fig. 2e, f, Extended Data Fig. 4). We use the Parallel Ice Sheet 
Model (PISM) to assess the response of the Antarctic and Greenland ice 
sheets to this oceanic and surface forcing through T-II and into the LIG 
as simulated by our transient climate run (Methods). The Greenland 
Ice Sheet starts to deglaciate from its PGM extent when adjacent ocean 
temperatures begin to warm at ~137.5 ka (Figs. le, 3g). It reaches its 
present extent at 131.5 ka and then loses an additional 0.88 m GMSLE 
by 119.5 ka, largely through oceanic forcing of those sectors of the ice 
sheet that remain marine-based, causing draw-down of the ice-sheet 
interior (Fig. 3c). The majority (3.42 m) of the total sea-level rise (3.88 m) 
occurs between 136-129 ka (Fig. 3g), corresponding to the period of 
rapid rise in global mean sea level (GMSL) (Fig. 1a). Sensitivity tests in 
which ocean temperatures are held constant at either PGM or LIG values 
showthat the simulated deglaciation is controlled entirely by oceanic 
forcing (Methods, Extended Data Fig. 6), supporting our hypothesis 
that oceanic forcing contributed to deglaciation of other Northern 
Hemisphere ice sheets (Extended Data Fig. 4). 

Our simulations also show that the major deglacial phase of the Ant- 
arctic Ice Sheet from its PGM extent closely coincides with the onset of 
warming of adjacent ocean temperatures at ~137.5 ka induced by the 
slowdown in AMOC (Figs. If and 3g). In particular, the ice sheet retreats 
to its present extent at ~128 ka, with the majority (6.25 m) of the total 
(6.65 m) sea-level rise occurring during the rapid T-II rise in global sea 
level (Fig. 3g). Sea-level rise then begins to slow at 128 ka, followed by 
an acceleration starting at 126.5 ka; a total of 2.99 m of LIG sea-level rise 
occurred by 116 ka (Fig. 3g). The majority of this LIG deglaciation is asso- 
ciated with collapse of the Amundsen Sea sector of the West Antarctic 
Ice Sheet (Fig. 3e) largely in response to oceanic forcing (Extended Data 
Fig. 6), similar to what is suggested by observed recent changes and 
projected for future ice-sheet recession in this area”"*. This destabiliza- 
tion leads to ice-sheet retreat that continues after the period of peak 
oceanic forcing at a rate that is determined largely by the retrograde 
gradient of the bed beneath the West Antarctic Ice Sheet, followed by 
a slowing of this retreat as the Southern Ocean cools (Fig. 2f). 

We next apply anice-age sea-level model” to predict how our simu- 
lated changes in LIG ice-sheet mass would be recorded at three widely 
distributed sites with well-dated corals that provide minimum esti- 
mates of relative sea level (RSL) during the LIG*”, and a speleothem 
record that bounds RSL during the same period”® (Methods). Of the 
five adopted ice histories, the two that are based on studies that use 
a considerably larger Eurasian Ice Sheet during the PGM relative to 
the LGM’ (Lambeck (LAM) and Hybrid (HYB); see Methods) pre- 
dict RSL histories during the LIG that are consistent with the elevation 
determined from the corals from the Bahamas, Western Australia and 
the Seychelles (Fig. 4a—c, Extended Data Fig. 7). However, all simula- 
tions tend to underestimate the first half (before 122 ka) of the LIG RSL 
evolution inferred from the speleothem record in Mallorca (Fig. 4d, 
Extended Data Fig. 7). 

In the absence of melting of polar ice during the LIG, predictions 
of RSL at the Bahamas and Mallorca would show a monotonic rise, 
while those at the Seychelles and Western Australia would tend to show 
a monotonic fall (Fig. 4)”°. Our ice-sheet simulations, however, are 
characterized by excess melt from the West Antarctic Ice Sheet (rela- 
tive to the present day) that increases from 0 to 3m GMSLE between 
127 kaand124 ka (Fig. 3g). This signal is responsible for the accentuated 
RSL rise over the same period in the prediction for the Bahamas and 
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Mallorcaand the reversal inthe RSL trend at Western Australia and the 
Seychelles (Fig. 4). 

Our results do not account for several other processes that may have 
caused LIG global mean sea level and RSL to have been even higher 
than modelled here. For example, our atmospheric modelling may 
underestimate surface melting around the lower-elevation margins of 
the Greenland Ice Sheet. The greater boreal summer insolation forcing 
during the LIG relative to the present interglaciation probably caused 
even greater mass loss from glaciers, which today account for 0.41m 
GMSLE. Warmer-than-present LIG temperatures would have caused 
additional thermosteric sea-level rise®. Finally, we note that any addi- 
tional melt near the start of the LIG would change the preferred Earth 
models identified in our analysis (Methods). LIG RSL could also be 
influenced by dynamic topography due to mantle convection, intro- 
ducing metre-scale displacement on timescales of tens of kiloyears”’. 

Insummary, several lines of evidence suggest that the greater oceanic 
forcing during T-II, as simulated by our climate modelling, contributed 
tothe more-rapid sea-level rise during that period than T-1. First, forcing 
of ice-sheet surface mass balance was similar during the two termina- 
tions, indicating that an additional forcing is required to explain the 
differences in rates of sea-level rise. Second, our glacio-isostatic adjust- 
ment modelling demonstrates that the larger PGM Eurasian ice sheet 
caused a substantially larger fraction of the ice-sheet bed to be below 
sea level, andthus more vulnerable to oceanic forcing, than during T-I. 
Third, our ice-sheet modelling shows that >85% of the volume loss of the 
PGM Greenland and Antarctic ice sheets to their present sizes occurs 
in response to oceanic forcing during T-II. Although additional model- 
ling of the deglaciation of the former Northern Hemisphere ice sheets 
during T-II will be required to further support this hypothesis, our ice- 
sheet modelling does show that oceanic forcing was the primary driver 
of excess ice loss from the Greenland and Antarctic ice sheets during 
the LIG. Our sea-level predictions demonstrate that the modelled 4m 
increase in GMSLE from LIG deglaciation of the Greenland and Antarctic 
ice sheets may explain much of the LIGRSL at intermediate- and far-field 
sites when GIA from T-II deglaciation is included, although additional 
melt is required to fully reconcile these data. 
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Methods 


Transient climate modelling 

We use the fully coupled configuration of CCSM3 with T31 model spec- 
tral resolution (approximately 3.75°)” for the transient simulation of 
the penultimate deglaciation and LIG from 140 to 120 ka. CCSM3 was 
used in the TraCE-21K transient simulation of the past 21 kyr spanning 
the last deglaciation and the current interglaciation””!". The transient 
simulation of the penultimate deglaciation was initialized with a 600- 
yr equilibrium simulation of the PGM that branched off the TraCE-21K 
LGM simulation with an orbital configuration’ and GHG contribution 
(CO,) for 140 kyr (ref. **). The transient simulation of the penultimate 
deglaciation with CCSM3 was integrated from 140 kato 129 ka with vary- 
ing atmospheric GHG concentrations”, Earth orbital configurations’ 
and continental ice sheets based on ICE-5G”, but with the timing of the 
corresponding sea-level rise adjusted to closely follow the Waelbroeck 
et al.!and Grant et al.** sea-level reconstructions for the penultimate 
deglaciation (Extended Data Figs. 2 and 3). We note that our sea-level 
modelling suggests that the sizes of the PGM Northern Hemisphere 
ice sheets differed from those in the LGM, and thus our climate model- 
ling does not account for these important difference between the two 
terminations. Further climate modelling is therefore needed to assess 
how these differences may have affected both atmospheric circulation 
over the North Atlantic Ocean and the AMOC. 

To simulate the impact of FW forcing from H11 on the AMOC, fresh 
water is added at the surface of the North Atlantic in the area between 
50 and 70° N and ramped to 0.17 Sv (sverdrup, 1 Sv =10° m’s*) from 
138 kato 135.5 ka; it then remains constant until 129.7 ka when it is shut 
off (Extended Data Fig. 3). The transient simulation of the LIG with 
CCSM3 was integrated from 129 ka to 116 ka with changing orbits and 
atmospheric GHG concentrations under present-day ice-sheet con- 
figurations. No additional FW fluxes were applied during the transient 
simulation of the LIG. 


Ice-sheet modelling 
We use version 0.7.1 of PISM, in which the dynamical core superposes 
velocity fields from the shallow shelf and shallow ice approximations 
across the entire domain. Fast flow (‘streaming’) of grounded ice is 
enabled by plastic failure of subglacial sediments, which depends on 
a prescribed but spatially variable till friction angle that represents 
sediment strength and its degree of saturation. The till friction angle is 
based primarily on topography, so that deeper areas have lower friction 
angles. This mimics the effect of weaker sediments accumulating in 
deeper basins. The parameterization follows the form, phi min/phimax/ 
elevation min/elevation max, in which the phi minis the friction angle 
applied below elevation min, phi max is the friction angle applied above 
elevation max, and values in between are linearly interpolated. For our 
Greenland simulations we prescribe values of 10/30/-300/300, and for 
Antarctica 6/30/-700/-100. These values are based on, but modified 
from previous work (5/40/-700/700 for Greenland*; 10/30/-1000/200 
for Antarctica*°), but these values are uncertain. Our values were cho- 
sen following exploratory simulations that sought to best capture the 
broad-scale geometric and dynamic features of the ice sheets. 
Sediment strength evolves dynamically depending on the basal 
ice temperature. Where ice is sufficiently thick to allow basal melt- 
ing, meltwater weakens the substrate until driving stresses exceed 
till cohesion. Failure of the substrate that results in acceleration of 
the overlying ice follows a pseudo-plastic law’, such that a small 
increase in stress above the shear strength of the substrate leads to 
an increasing velocity response. This ultimately thins the ice, which 
reduces the gravitational driving stress and results in a deceleration 
of the ice sheet. The cyclic behaviour of ice streams that occurs as a 
consequence of this mechanism is described in more detail elsewhere”. 
PISMuses a sub-grid grounding line scheme” in which the interpolation 
of sub-ice shelf melt across the grounded to floating transition may 


be turned on or off. When turned on, the scheme tends to accelerate 
ice-sheet retreat in marine basins, whereas when it is off, the scheme 
produces a slower response”. This difference in behaviour results in 
differences in retreat rates, but equilibrium states (such as ice volume) 
are less affected. In our experiments we investigated both approaches, 
and found that interpolating sub-shelf melt across the grounding line 
produced simulations that were more consistent with geological con- 
straints for T-I (see below). 

We also used a range of enhancement factors for the shallow ice 
(SIAe) and shallow shelf (SSAe) equations (SIAe = 1, 2, 3; SSAe = 0.5, 1), 
and different values for the basal sliding exponent that controls how 
plastic or linear the substrate deformation response to applied driv- 
ing stresses (q = 0.25, 0.6). Floating ice is controlled by two calving 
mechanisms: one is based on horizontal strain rates“ and the other 
prescribes aminimum thickness criterion (50 m for Greenland, 200 m 
for Antarctica). 

We run separate simulations for the Greenland and Antarctic ice 
sheets, both at 20-km resolution. To drive our ice-sheet model, we use 
output climatologies from the transient CCSM3 simulations described 
above for T-I and T-II. Atmospheric outputs are applied as anomalies to 
present-day air temperature and precipitation fields”, in the same 
manner previous studies**. We employ a positive-degree day (PDD) 
model to translate temperatures above freezing into surface melt, of 
which 60% remains in the snowpack as a consequence of refreezing 
during percolation. The proportion of refreezing that takes place even 
under present conditions is difficult to constrain precisely, so we use 
auniform value both for the control and perturbation experiments to 
minimize the effects of this parameterization. That is, differences in 
the simulation outputs are unlikely to arise from uncertainty in this 
aspect of the model parameterization. 

During our model tuning process, we explored a wide range of 
degree-day values (from 1mm °C“ day‘ up to 64 mm °C day*) inde- 
pendently for both snow andice. We tried the more usual melt threshold 
of 273 K and the lower value of 270 K following van den Broeke et al.**. 
The latter method yields more widespread melt, mimicking the possible 
melt arising from shortwave radiation under sub-freezing conditions, 
and thus degree-day factors are typically lower*®. We also allow for 
stochastic variability in daily temperatures using a zero-mean white 
noise component with a standard deviation set at 5 K. Although the 
choice of PDD parameters did exert some control on the geometry 
of the evolving ice mass, the basic shape of the ice sheet evolved ina 
similar manner regardless of either the melt forcing or the glaciological 
parameterization, suggesting that the dominant control on ice-sheet 
geometry is the climate forcing from the our general circulation model. 
Recent work has shown that our simulations of surface mass balance 
(SMB) of the Greenland Ice Sheet (GrIS) under the high boreal summer 
insolation of the LIG may be sensitive to climate model resolution and 
SMB model type (that is, PDD, surface energy balance)”. 

Oceanic fields for temperature and salinity at 500 m depth were 
used as inputs to a thermodynamic ocean model that calculates basal 
melt from salt and heat-flux gradients across the ice-ocean interface, 
according to the scheme described in ref. #8. As with the atmospheric 
variables, we apply the oceanic fields as anomalies from a present-day 
ocean configuration that for Antarctica is tuned to reproduce observed 
melt-rate patterns”. As such constraints are not currently available 
for Greenland, we use a spatially uniform melt factor instead, whichis 
iteratively refined so that both LGM and present-day ice-sheet extents 
are reproduced (see below). Ice thickness and bed topography for the 
two ice sheets are taken from the most recent compilations. 

With the model set-up as described above, we ran a series of time- 
evolving experiments that first focused on T-I, rather than T-II. The 
rationale for this approach is that substantial geological data exist 
with which to constrain the evolving ice-sheet geometry through the 
last deglaciation, whereas there are few constraints for the preceding 
T-II. Therefore, to optimize our parameter settings, we undertook >500 
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experiments for T-I for both ice sheets until a good fit to empirical 
constraints was found. For Antarctica, our guiding constraints are 
that the ice sheet at the LGM, immediately before T-I, should occupy 
the majority of the continental shelf and have an ice-volume excess 
above the present that is within the range of 5.6-14.5 m from previous 
simulations >. Furthermore, we required that the evolution of the 
simulated ice sheet must reproduce the glacial maximum thickening in 
West Antarctica and thinning in East Antarctica inferred from ice-core 
analyses», and exhibit a pattern of mass loss that is consistent with 
geologically inferred deglacial changes in ice discharge”. In Greenland, 
geological constraints on the offshore extent of the LGM ice sheet are 
sparse, but the ice volume excess is thought to have been in the range 
of 2-5 m GMSLE°”*®. We use this range as our target (Extended Data 
Fig. 8). Finally, both ice sheets are required to reproduce present-day 
grounded ice extent and volume as closely as possible at the end of 
the T-I simulations. 

Once this phase of parameter optimization is complete, we run our 
experiments for T-II using the same settings, changing only the input 
climatology based on outputs from CCSM3. This dual approach allows 
the robust simulation of a period, suchas T-Il, for which few data exist to 
constrain outputs. In addition, this methodology allows for the direct 
comparison of model outputs for the two periods, allowing any dif- 
ferences to be attributed solely to the imposed climate forcing rather 
than to uncertainties in the modelling procedure. Finally, by tuning the 
model to fit relatively well-known constraints such as LGM and present- 
day extent and volume, we reduce the influence of any inaccuracies in 
the climate model representation of air or ocean temperatures during 
the periods of simulation. Thus, if CCSM3 under- or overestimates 
the magnitude of past climate anomalies with respect to the present, 
the internal consistency between the T-I and T-II climate simulations 
coupled with the data-constrained simulation of T-I mean that the 
reliability of the T-II simulation is unaffected. 

Anovelty of our ice-sheet simulations compared with previous stud- 
ies’ is that we use a fully evolving T-I experiment to constrain our 
model parameterizations. This includes components such as degree- 
day factors for the PDD scheme. For Greenland, we run an ensemble of 
tuning experiments that explore a range of snow and ice melt factors 
as well as ice-flow enhancement coefficients (Extended Data Fig. 8). By 
then selecting the parameterization that at the end of the T-I simula- 
tion most closely reproduces present-day ice volume and geometry we 
ensure that the surface melt fields we generate are realistic. We then 
apply this set-up to our T-Il experiments. Our annual temperature range 
is defined by the CCSM3 outputs. However, we also experimented with 
duplicate simulations in which we modified our Greenland climatolo- 
gies to incorporate summer temperatures from Fausto et al.””. These 
simulations resulted in only minor differences in mass change, suggest- 
ing that atmospheric forcing plays a lesser role than oceanic forcing 
in our experiments (Extended Data Figs. 6 and 9). This is supported 
by experiments in which we also explored alternative grounding line 
schemes to make the ice sheets either more or less sensitive to ocean 
temperature change. In the less-sensitive experiments, the ice sheet 
failed to advance sufficiently far offshore, and was thus incompatible 
with geological constraints. 


Predictions of relative sea level 

Calculations of glacial isostatic adjustment described in the text are 
based ona pseudo-spectral sea-level theory” for the case of spherically 
symmetric Maxwell viscoelastic Earth models (that is, rheology varies 
with depth alone), witha truncation at spherical harmonic degree and 
order 256. The theory incorporates time-varying coastlines, changes 
inthe perimeter of grounded, marine-based ice sheets and the impact 
onsea level of load-induced perturbations to the Earth’s rotation axis, 
where these perturbations are computed using the rotational stability 
theory of Mitrovica et al... Profiles of the density and elastic structure of 
the Earth model are taken from the seismic Preliminary Reference Earth 


Model™. The viscosity structure of the Earth models is defined by three 
layers: a lithospheric zone of infinite viscosity and sub-lithospheric 
upper and lower mantle regions, where the boundary between the 
latter two regions is taken to be 670 km depth. The thickness of the 
lithosphere and the viscosity of the upper and lower mantle are free 
parameters and are varied, respectively, within the following ranges: 
30-140 km; 2-20 x 10”° Pas; and 2-100 x10” Pas. 

The set of five ice histories adopted in this study is based, in part, 
on histories constructed by Dendy et al.’ in their investigation of the 
sensitivity of LIG sea level predictions to variations in the timing and 
geometry of ice cover during Marine Isotope Stage 6. We begin by sum- 
marizing these ice histories. 

All models in Dendy et al.”8 use the ICE6G ice history® for the period 
extending from the LGM to present day and they extend back four 
full glacial cycles. The models are constrained to have interglacial ice 
volumes and geometry identical to present-day ice cover onthe Earth 
(that is, there is no excess ice melting during previous interglaciations, 
including the LIG). The so-called Waelbroeck (WAE) ice model adopts 
the eustatic sea-level curve estimated by Waelbroeck et al.' on the basis 
of benthic foraminifera isotope records. In the period before the LGM, 
the ice geometry is constrained to be identical to the geometry post- 
LGM whenever the eustatic values are identical. The LAM and Colleoni 
(COL) models in Dendy et al.” also adopt the pre-LGM eustatic curve 
of Waelbroeck et al.’, but are distinguished from WAE by their ice his- 
tory during the penultimate glacial cycle. In particular, these models 
adopt the ice geometry during the PGM inferred by Lambeck et al.” and 
Colleoniet al.”’, which are both characterized by more substantial ice 
cover over Eurasia during the PGM than the LGM. As the difference in 
peak Eurasian ice volume during the PGM inthe LAM and COL models 
is large (55 mand 71m GMSLE, respectively), we have constructed an 
intermediate ice history (HYB) that is essentially the average of these 
models (peak volume of 66 m GMSLE during the PGM) The increased 
ice cover of the LAM, HYB and COL models relative to the WAE modelis 
compensated, in large part, by a reduction of the volume of the Lauren- 
tide Ice Sheet during the PGM relative to the LGM” All four models, 
WAE, LAM, HYB and COL, converge to the same ice geometry (that is, 
the present-day ice geometry) at the beginning of the model LIG. We 
note that we have adapted the WAE, LAM and COL models described by 
Dendy et al.”’ to more closely follow the eustatic curve of Waelbroeck 
et al.’. Finally, the model SHA in Dendy et al.”’ is constructed in a man- 
ner identical to WAE, with the exception that the model adopts the 
eustatic curve derived by Shakun et al. in the period before the LGM. 

The ice histories considered in the present study combine the five 
models described above with the Antarctic Ice Sheet (AIS) and GrIS 
histories discussed in the main text. Specifically, the difference in ice 
height during the period from 140 ka to 116 ka relative to the present 
day in the ice-sheet simulations of the main text are applied to each of 
the Dendy et al.”8 models. The net result is that the five models con- 
structed in this manner are characterized, in contrast to those in Dendy 
et al.8, by excess melting of the AIS and GrlS during the LIG relative to 
the present day. We ran 337 Earth models for each of the S ice histories 
(total of 1,685 simulations) in which parameters defining the Earth 
model were varied over plausible ranges. 

In exploring the fit of the RSL predictions to the coral record, we 
considered three sites that have the largest datasets of well-dated corals 
(the Bahamas, Seychelles and Western Australia) and a relatively new 
speleothem data set from Mallorca” (Extended Data Fig. 7). Given that 
corals provide a minimum bound onsea level, our metric for fit for these 
data was the number of coral records that any specific RSL prediction 
bounded from above. By contrast, we interpret the height uncertainties 
associated with the published speleothem data to represent a two-way 
bound on peak RSL. 

None of the 1,685 simulations (that is, our sampling of 337 Earth 
models and Sice histories) were successful in bounding all coral records 
from above. As an indication of performance, Extended Data Fig. 7 


shows predictions from the full suite of simulations that satisfy the 
following criteria: (1) all coral data from Western Australia and the 
Bahamas, with the exception of the earliest datum at the latter site (at 
~131 ka), fall below the prediction; and (2) the prediction at the Sey- 
chelles falls above all three coral records at an elevation of -4 m. The 
various lines on the figure represent the different Earth models for each 
ice history that satisfy these constraints. For each ice history (that is, 
each column of Extended Data Fig. 7), the Earth models sampled on 
each frame (that is, each site) represent a discrete set that may or may 
not overlap with the set from a different site. As an example, in the 
case of the LAM and COL ice histories, no single Earth model appears 
on the results for all three sites. This is reflected in Fig. 4, where the 
simulation highlighted in each frame is the result for a distinct Earth 
model. This variation is justified by the fact that the Earth’s mantle is 
subject to large amplitude variations in viscoelastic structure and so 
the sea-level response at each of the three sites would not be expected 
to favour the same Earth model. 

Note that the number of simulations that satisfy our plotting criterion 
for the Bahamas increases as one moves to ice histories with larger 
Eurasian ice cover at the PGM (that is, fromthe WAEtothe COL results), 
but the number of simulations that satisfy the criteria for the Seychelles 
decreases in the same sense. While not apparent from Extended Data 
Fig. 7, the predicted highstand at Seychelles increases as one considers 
Earth models with progressively thinner elastic lithospheres (see figure 
9A of ref. 8), and the simulations that predict RSL highstands above the 
Seychelles records are those based ona lithospheric thickness of 30 km 
(asin Fig. 4) or, ina couple of cases for the WAE ice history, 50 km. This 
raises two important issues. First, none of the simulations that yield 
RSL above all the coral elevations at the Seychelles also satisfy the geo- 
logical constraints at the Bahamas. Second, as the predicted highstand 
at the Seychelles is sensitive to the adopted lithospheric thickness, 
there is a trade-off between the preferred value of this parameter and 
the level of excess melting during the LIG. That is, increasing polar ice 
sheet melting above the ~-4 m GMSLE adopted in the simulations in 
Extended Data Fig. 7 would increase the range of lithospheric thick- 
ness that would satisfy the Seychelles coral record, and thus bring the 
inference into better accord with other GIA-based estimates of this 
Earth model parameter. 

This issue may also be relevant to the results for Mallorca (Extended 
Data Fig. 7), where simulations are plotted only ifthe misfit between the 
GIA predictions and the speleothem observations is within 50% of the 
minimum misfit achieved in all simulations. In this case, fewer of the 
simulations provide a reasonable fit to the speleothem record as one 
considers ice histories with progressively larger volumes over Eurasia 
at PGM; no simulations based on the COLice history satisfy our plotting 
criterion. However, regardless of the adopted ice history, none of the 
simulations fit the highstand constraints before 125 ka. Bringing the 
GIA predictions in Extended Data Fig. 7 into accord with the Mallorca 
observations would require additional excess melting that is limited 
to the earliest phase of the LIG. 

Asa final point, simulations based on the SHA ice history yielded mis- 
fits significantly larger than predictions shown in Extended Data Fig. 7. 

The sea-level simulations described above yield changes in sea level 
and topography at each time slice of the ice history. As an example, 
Extended Data Fig. 5 shows the reconstructed topography for the area 
covered by the Scandinavian Ice Sheet at 131 ka, near the end of the 
Marine Isotope Stage 6 deglaciation, for a simulation based on the 
LAM ice history and a specific Earth model (see caption). The map 
supports the suggestion that the margin of grounded ice complexes 
in this region across MIS6 through Se were marine-based. 


Evidence for warming over the GrIS during the LIG 

Here we evaluate the evidence for warming over the GrIS during the 
LIG. This supports our climate model simulation, which showed that 
while the LIG atmosphere was warmer than in the pre-industrial, it 


largely remained below freezing and did not lead to significant mass 
loss from surface melting. 

Regarding the reconstructed LIG temperatures at the NEEM® and 
GISP2° ice-core sites, there is uncertainty in which d8¥0,,../d7 (where 
Tis the temperature) relationship should be used to reconstruct LIG 
temperatures, and this uncertainty is exacerbated when applying the 
modern d80,,./dT relationship to past climates, where differences in 
orbital forcing, moisture transport pathways, ice-sheet topography 
and sea-ice extent can change the relationship” ”. To illustrate some 
of these uncertainties, we have compared our simulated temperatures 
for the NEEM and GISP2ice-core sites with the temperature reconstruc- 
tions for these sites based on 6'°0,,, (Extended Data Fig. 10). These 
reconstructions span the interval 127-120 ka, which is the warmest 
interval in the ice-core records for the LIG suggested by this proxy. 
The published reconstructed temperatures for GISP2 (blue symbols 
in Extended Data Fig. 10a)°°”? and NEEM (dark blue line on Extended 
Data Fig. 10b)® are based on the relation d5°0,,./d7 = 0.5%o C™, which 
is derived from Greenland ice-core sites elsewhere”. During the LIG, 
the precipitation-weighted 680 is probably biased towards summer 
months rather than mean annual temperature (van de Berg et al.®°), so 
we compare this reconstruction with our simulated summer tempera- 
ture (JJA) (grey line in Extended Data Fig. 10). This suggests that our 
simulated JJA temperatures underestimate the mean of the reconstruc- 
tions by 4-5 °C. This difference is reduced when we account for our 
simulated ice-surface lowering of -200 m at NEEM and ~400 mat GISP2 
(see Fig. 3c) and assume the lapse rate of 7.5 °C km ‘used by Dahl-Jensen 
et al.©, thus placing our results within the published uncertainties of 
the reconstructions (green line in Extended Data Fig. 10). 

However, following the publication of Dahl-Jensen et al.®, Masson- 
Delmotte et al.” established that the d&°0,../d7 relation at the NEEM 
site is ~1.1%0 C“, suggesting that the NEEM and GISP2 LIG summer tem- 
peratures are about half of the originally published values based on 
the Vinther et al.” d6'80,../d7 relation (red symbols in upper panel, red 
line in lower panel of Extended Data Fig. 10). Masson-Delmotte et al. 
(page 1,500 of ref. *) conclude that “For the last interglacial period, the 
observed 60 anomaly of 3.6% at NEEM deposition site would then 
translate into 3.6 + 0.7 °C warming, instead of the estimate of 7.5 +1.8 °C 
(NEEM, 2013) that was obtained using the Greenland average Holocene 
isotope-temperature relationship (Vinther et al., 2009).” 

Our simulated JJA temperatures (grey line in Extended Data Fig. 10) 
are thus only 1-2 °C colder than the mean reconstructions for GISP2 and 
NEEM based on this new calibration, but they are in excellent agreement 
with the mean values when accounting for our modelled ice-surface 
lowering (green line in Extended Data Fig. 10). 

Landais et al.” used 8N from the NEEM core to reconstruct tem- 
peratures that were 8.5° + 2.5 °C warmer during the LIG compared with 
pre-industrial temperatures. However, the SN reconstruction repre- 
sents annual temperature, whereas the 6°O,.. temperatures are biased 
towards the summer (is the critical season for influencing changes in 
surface mass balance through melting). The two temperature recon- 
structions are thus not directly comparable. Moreover, Landais et al.”° 
identify “large uncertainties’ (p. 1944) in their temperature reconstruc- 
tion, including in the firn model used, in the assumed accumulation 
rates and in the potential influence of surface melt on firn depth. 

We thus conclude that when using the most suitable temperature 
calibration for the ice-core sites and within the uncertainties of the ice- 
core proxy reconstructions, our climate model successfully captures 
the LIG summer (JJA) temperature anomaly relative to pre-industrial 
temperatures at NEEM and GISP2. Consistent with this model-data 
agreement for warmer LIGJJA temperatures, we find that the LIG surface 
mass balance of the GrIS is more negative than the present day mass 
balance (Extended Data Fig. 9). 

Dahl-Jensen et al. stated that “during our NEEM field campaigns 
(2007-2012), the mean surface air temperature in July reached —5.4 °C.” 
However, Box” reported the average JJA temperature for 2007-2012 
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at NEEM site as -10.9 + 0.3 °C, suggesting that Dahl-Jensen et al.® are 
reporting a maximum July temperature value during their period of 
record rather than climatology. But the JJA temperature that matters 
for comparison with the LIG are the pre-industrial temperatures, which 
Box” found to be -12.6 + 0.6 °C for 1840-1870 (the record closest to the 
pre-industrial period). (Dahl-Jensen et al.° compared with the average 
of the last millennium.) Thus, even if the Dahl-Jensen et al.® LIG tem- 
perature reconstruction is correct (7.5 + 1.8 °C warmer than the mean 
of the past millennium), average LIG summer temperatures would still 
be well below freezing (approximately —5 °C). Itis more likely, however, 
that they are even further below freezing when using the d5%0,,./dT 
relation established for the NEEM site”, that is, 3.6 + 0.7 °C warmer 
than the mean of the past millennium, with average LIG summer tem- 
peratures thus being —9 °C. 

The evidence for surface melt at the NEEM ice core site is based on: (1) 
alow-resolution record showing that 7 out of 73 samples have elevated 
CH, and N,O during the interval 127—118 ka, and (2) a high-resolution 
CH, record that suggests five melt events in the 123.5-122.5 ka inter- 
val, or one every 200 years®”®. Noble gases that were measured at the 
times of four of the five elevated CH, events in the high-resolution 
record confirm melting during these events”. This alone makes it clear 
that these were infrequent periods of melting rather than continu- 
ous melting throughout the LIG. According to Anais Orsi (personal 
communication), during a melt event, such as the 2012 event”, the 
melt percolates and refreezes in the top 1m of the firn, often in many 
layers, So one melt event may be represented by more than one melt 
layer. Moreover, although the noble gas results clearly identify four 
periods of enhanced melting, one cannot exclude the possibility that 
each sample represents a single 2012-like melt event. 

In summary, ice-core proxies suggest that Greenland LIG tempera- 
tures were warmer than present, but the amount of warming from 
these proxies remains subject to uncertainty. However, even the high- 
est estimates of warming still suggest that average JJA temperatures 
remained well below freezing relative to pre-industrial temperatures, 
and—on the basis of the more-appropriate 5'%0,..-temperature cali- 
bration from Masson-Delmotte et al.*—are in good agreement with 
our simulated temperatures for the ice-core sites. Consistent with 
this model-data agreement for warmer LIGJJA temperatures, we find 
that the LIG surface mass balance of the GrIS is more negative than 
the present day mass balance. Rare episodes of melting occurred, but 
while their frequency may increase under higher mean temperatures 
and insolation (such as those recorded in the Holocene section of the 
GISP2 ice core®°), we conclude that with a frequency of perhaps only 
one melt event every 200 years, they had a negligible influence onthe 
long-term surface mass balance, and average summer temperatures at 
the NEEM site otherwise remained well below freezing during the LIG. 


Influence of FW forcing from modelled mass loss from the GrIS 
and AIS on ocean circulation during the LIG 

We did not include additional FW forcing after 129.5 ka, but we show 
here that the FW fluxes from our modelled mass loss from the GrlIS 
and AIS after 129.5 ka (during the LIG) were too small to have influ- 
enced the AMOC or Antarctic Bottom Water formation. As global sea 
level reached modern levels at 129-130 ka and our modelled AMOC 
resumes at 129.5 ka, we consider only the FW fluxes from the GrIS and 
AIS since 129.5 ka. 

From 129.5 ka to 127 ka, modelled GrIS mass loss was 0.2m of GMSLE, 
which is equivalent to a FW flux of 0.0009 Sv. From 127 ka to 117.5 ka, 
GrIS mass loss was 0.09 m, which is equivalent to 0.0001 Sv. For refer- 
ence, Bakker et al.*' showed that a FW flux of 0.01 Sv from Greenland for 
the RCP 4.5 scenario (see their figure SI3) results ina median reduction 
inthe AMOC of -5% (their Fig. 2, GrIS only). The FW fluxes from LIG loss 
of the GrIS in our model are two orders of magnitude smaller than this, 
and would therefore have no impact onthe AMOC, or consequently on 
our ice-sheet model simulations. 


From 129.5 kato 123.5 ka, AIS mass loss was 4.1 m, which is equivalent 
toa FW flux of 0.008 Sv. Bakker et al.®” found that a FW flux of 0.12 Sv 
from the AIS increases variability in the Antarctic Bottom Water by 
~10% and in AMOC by ~5%. The FW fluxes from LIG loss of the AIS in our 
modelisa factor of 15 smaller than this, and thus would have no impact 
on the Antarctic Bottom Water or the AMOC, or consequently on our 
ice-sheet model simulations. 


Data availability 


Antarctic bedrock topography and ice thickness data are from the 
BEDMAP2 compilation, available at https://secure.antarctica.ac.uk/ 
data/bedmap2/. Greenland topography and ice thickness data are from 
BedMachine v3, available at https://nsidc.org/data/idbmg4. Green- 
land mass balance and geothermal heat flux data are available from 
the seaRISE website: http://websrv.cs.umt.edu/isis/index.php/Data. 
Information on the Antarctic surface mass balance data is available 
at http://www. projects.science.uu.nl/iceclimate/models/antarctica. 
php#racmo23. Antarctic geothermal heat flux data are available at 
the Open Science Framework https://doi.pangaea.de/10.1594/PAN- 
GAEA.882503. The datasets generated and used for this study (Figs. 1-4, 
Extended Data Figs. 3-9) are available from the Open Science Frame- 
work (https://doi.org/10.17605/OSF.1IO/FX7WK). 


Code availability 


CCSM3 is freely available as open-source code from http://www.cesm. 
ucar.edu/models/ccsm3.0/. PISM is freely available as open-source 
code from https://github.com/pism/pism.git. 
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Extended Data Fig. 1| See next page for caption. 


Extended Data Fig. 1| Climate and sea-level records for T-II and T-I. a, eNd 
records from the North Atlantic Ocean as proxies of AMOC transport!**>. b, 
CCSM3 maximum AMOCtransport (below 500 m) (this study). c, EPICA Dome C 
&D record on AICC2012 age modelas proxy of Antarctic temperature* (blue 
line) and percentage of warm planktonic foraminiferal species as proxy of 
North Atlantic sea surface temperatures® (grey line). d, 6'°O record from 
Chinese stalagmite as proxy of Asian monsoonstrength”. e, Rate of sea-level 
change derived from anRSL reconstruction based on benthic foraminifera 
isotopes’. f, A stack of North Atlantic ice-rafted debris records recording 


H11°385-*”, g eNd (ref. 88; brown, orange symbols) and Pa/Th (ref. *°; purple, 
greensymbols, louncertainty) records from the North Atlantic Ocean as 
proxies of AMOC. h, CCSM3 maximum AMOCtransport (below 500 m) (this 
study). i, EPICA Dome C §D record on the AICC2012 age model (dark blue line)™* 
as proxy of Antarctic temperature and atemperature reconstruction from the 
Greenland GISP2 ice core (light blue line)”°. j, 6'°O record froma Chinese 
stalagmite as proxy of Asian monsoon strength". k, Rate of sea-level change 
derived from an RSL reconstruction based on benthic foraminifera isotopes". I, 
Astack of North Atlantic IRD records that log H1”°. 
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Extended Data Fig. 2| Sea-level records for the last two terminations and 
interglaciations. a, Sea-level reconstructions for the penultimate deglaciation 
andthe LIG (the latter identified by the grey-shaded area). The eustatic sea- 
level record is based on benthic foraminifera isotopes (blue line with lo 
uncertainty)’ and the RSL record is based on Red Sea isotopes (grey crosses; 
green line, l-kyr moving Gaussian filter)** placed onarevised age model”. Also 
shownareRSL data from U-series dated corals at Tahiti (sky blue circles)”, the 
Huon Peninsula (light blue-green circle; altered samples shown by grey 
circles)”, the Seychelles (light green circles)”, western Australia (blue circles)* 
and the Bahamas (cyan circles). All of the U-series ages have been recalculated 
to normalize them with the same set of decay constants for7**U and?*°Th 

(ref. **) and are shown with 2oage uncertainty. We note that the offset between 
the Red Sea record (green line) and the benthic foraminifera record (blue line) 
may reflect the complex three-dimensional Earth structure in the vicinity of 
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the Red Sea rift®>®. The variability in the Red Sea and Huon Peninsula RSL 
records may reflect a sea-level reversal at -137 ka (ref. "), which, if it existed, was 
too small to be recorded by the benthic foraminiferal record. The rate of sea- 
level change based on the benthic foraminiferal record is also shown. b, Sea- 
level reconstructions for the last deglaciation and the present interglaciation 
(the latter identified by the grey-shaded area). The record of global mean sea 
level is based on benthic foraminifera isotopes (blue line with louncertainty)'. 
Also shownare individual sea-level estimates (black circles, 2ouncertainty) 
that have been corrected for glacial isostatic adjustment”. The rate of sea-level 
change based on the benthic foraminiferal record is also shown. c, Top, eustatic 
sea-level reconstructions for the penultimate deglaciation (blue line withlo 
uncertainty) and the last deglaciation (black line with louncertainty)’. Bottom, 
21June insolation for 65° N for the penultimate deglaciation (blue line) and the 
last deglaciation (black line)’. 
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Extended Data Fig. 3 | See next page for caption. 


Extended Data Fig. 3 | Comparison of our FW forcing during T-II with other 
estimates. a, Our simulated changes in AMOC. b, Our FW forcing. 

c, Reconstruction of the FW flux from sea-level reconstructions from 
Waelbroeck et al.’. d, Reconstruction of FW flux from sea-level reconstructions 
from Marino et al.*. e, Our stack of IRD for H11 (Extended Data Fig. 1), which 
shows that the H11 interval of iceberg discharge is in good agreement with the 
timing of our FW forcing. f, The sea-level change associated with our FW flux 
into the North Atlantic (grey line), the sea-level change associated with the 
ICE-5G ice sheets® used as a boundary condition in our climate model (green 
line) and a reconstruction of global sea-level change! (blue line with lo 
uncertainty). The timing of sea-level change in the ICE-5G time series shown 
here was adjusted from its chronology for T-I by adjusting the corresponding 
sea-level rise to closely follow the sea-level reconstructions from Waelbroeck 
et al.1and Grant et al.** for the penultimate deglaciation. 
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Extended Data Fig. 4 | Maps of the evolution of temperature at 400 m water 140-128 ka (f), 140-126 ka (g) and 140-124 ka (h). i-p, Maps of the evolution of 
depth in the North Atlantic, Arctic and Southern oceans between 138 ka and temperature at 400 m water depth in the Southern Ocean for 140-138 ka (i), 
124 karelative to the temperature at 140 ka. a-h, Maps of the evolution of 140-136 ka (j), 140-134 ka (k), 140-132 ka (I), 140-130 ka (m), 140-128 ka (n), 
temperature at 400 m water depth in the North Atlantic and Arctic oceans for 140-126 ka (0) and 140-124 ka (p). 

140-138 ka (a), 140-136 ka (b), 140-134 ka (c), 140-132 ka (d), 140-130 ka (e), 
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Extended Data Fig. 5 | Predicted topography for the area covered by the 
Scandinavian Ice Sheet at 131 ka. The calculation is based onthe LAM ice 
history (see text) and an Earth model characterized by a lithosphere of 1OOkm 
thickness, an upper mantle viscosity of 3 x 10°° Pas and alower mantle viscosity 
of 5x10” Pas. The white zone ina represents the coverage of grounded ice 


extent at this time and the dashed white line on this frame is the shoreline 
location. b, As fora, except the area of ice coverage is removed. Itis clear froma 
that all but the southeast section of the perimeter of the Scandinavian ice sheet 
is predicted to be marine-based at this time, and from b that much of the 
interior of theice sheet was also marine-based. 
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Extended Data Fig. 6 | Results of sensitivity tests to oceanic forcing of the line). Present interglacial ice volume shown by the horizontal dashed line. c, As 
GrIS and AIS. a, Response of the GrIS to atmospheric forcing from CCSM3 with ina, but the vertical scale (grounded ice volume) has been increased to better 
fixed ocean temperatures for the PGM (blue line) and LIG (orange line) illustrate the response. The initial ice-sheet size used in this experiment (and 
compared with the ice-sheet response to atmospheric and oceanic forcing the comparable one for Antarctica) was the LIG ice sheet, whereas the climate 
(black line). The present-day interglacial ice volume shown by horizontal forcing used was for the penultimate deglaciation and the LIG: thatis, froma 
dashed line. b, Response of the AIS to atmospheric forcing from CCSM3 with colder-than-present to LIG climate, resulting ina small response to the 
fixed ocean temperatures for the PGM (blue line) and for the LIG (orange line) atmospheric forcing, as the LIG ice-sheet size had already adjusted tothe 


compared to ice-sheet response to atmospheric and oceanic forcing (black combined atmospheric and oceanic forcing, as shown by the black line ina. 
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Extended Data Fig. 7 | Predictions of RSL at three far-field sites (the 
Seychelles, Western Australia and Mallorca) and one intermediate-field site 
(the Bahamas). a—d, RSL predictions for the Bahamas from the full suite of 
simulations that bound from above all of the coral data with the exception of 
the earliest datum (at ~131 ka) for the COL ice history (a), the LAMice history (b), 
the HYBice history (c) and the WAE ice history (d). e-h, RSL predictions for the 
Seychelles from the full suite of simulations that lie above the three coral 
records with an elevation of -4 m for the COL ice history (e), the LAM ice history 
(f), the HYB ice history (g) and the WAEice history (h).i-I, RSL predictions for 
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western Australia from the full suite of simulations that bound from above all of 
the coral data for the COL ice history (i), the LAM ice history (j), the HYBice 
history (k) andthe WAEice history (I). The age uncertainty is 20, andthe depth 
uncertainty reflects the uncertainty in habitat depth. m-p, RSL predictions for 
Mallorca from the full suite of simulations that fit the data within 50% of the 
minimum misfit achieved for all simulations for the COL ice history (m), the 
LAM ice history (n), the HYB ice history (0) and the WAE ice history (p). The age 
uncertainty is 20, and the depth uncertainty reflects the uncertainty in 
speleothem water depth. 
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Extended Data Fig. 8 | Sensitivity of the GrIS model to melt amedium amount of surface melting (c), anda high amount of surface melting 
parameterization. a, Time series of tuning experiments for the GrIS with the (d). Values shown are the differences from the reference experiment. These 
preferred runin blue and three runs used for b-d shown in green, orange, and experiments are identical to the T-I reference experiment used to parameterize 
red. b-d, Surface elevation differences under a present-day climatology at the the T-II simulations (Fig. 3) except for the degree-day factors used. The results 
end of the 40,000-yr T-I parameter tuning experiments, using degree-day show that ourice-sheet model is sensitive to the way in which surface mass 


factors drawn from our ensemble that give alow amount of surface melting (b), | balance is parameterized by controlling the amount of surface melting. 
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Extended Data Fig. 9 | Simulated ice-volume changes and components of the 
mass balance for the GrIS. a, Simulated changes in ice volume for T-I. 
b, Simulated changes in mass-balance components for T-I.c, Simulated 


changes inice volume for T-II. d, Simulated changes in mass-balance 
components for T-II. e, Modelled surface mass balance anomaly during the LIG 
(129-120 ka) with respect to the modelled present-day mass balance. 
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Extended Data Fig. 10 | Comparison of our simulated summer temperature 
for Greenland ice-core sites with the temperature reconstructions for these 
sites based on 8'°0,,,. a, The simulated summer temperature (JJA) (grey line) 
and lapse-rate corrected JJA temperature (green line) compared with 
reconstructed temperatures for the GISP2 ice-core site (blue symbols, 1a 
uncertainty)? based on the relation d6“0,,,/d7 = ~0.5%o C“, which is derived 
from Greenland ice-core sites elsewhere”. Also shownare the reconstructed 
temperatures using the d5¥0,,,/d7 relation established for the NEEM site 
(-1.1%0 C“)’ (red symbols, louncertainty), suggesting that the GISP2 LIG 
summer temperatures are about half of the originally published values based 
onthe Vinther et al.” d6"80,../d7 relation and in good agreement with our 


model results. b, The simulated JJA temperature (grey line) and lapse-rate 
corrected JJA temperature (green line) compared with reconstructed 
temperatures for the NEEM ice-core site (dark blue line, grey shading is 
uncertainty)® based on the relation d5°0,,./d7 = ~0.5%o C' which is derived 
from Greenland ice-core sites elsewhere”. Also shown are the reconstructed 
temperatures using the d5"*0,,,/d7 relation established for the NEEM site 
(-1.1%0 C“)* (red line, pink shading is uncertainty), suggesting that the NEEM 
LIG summer temperatures are about half of the originally published values 
based onthe Vinther et al.“ d6“°O,.,/d7 relation and in good agreement with our 
model results. These reconstructions span the interval 127-120 ka, whichis the 
warmest interval in the ice-core records for the LIG suggested by this proxy. 


Article 


Ancient West African foragers in the context 
of African population history 


https://doi.org/10.1038/s41586-020-1929-1 Mark Lipson", Isabelle Ribot”, Swapan Mallick'**, Nadin Rohland', Ihigo Olalde"”°, 
Nicole Adamski"“, Nasreen Broomandkhoshbacht"*”’, Ann Marie Lawson", Saioa Lopez’, 
Jonas Oppenheimer“, Kristin Stewardson“, Raymond Neba‘ane Asombang®, 

Hervé Bocherens”“, Neil Bradman®”, Brendan J. Culleton®, Els Cornelissen”®, 

Isabelle Crevecoeur", Pierre de Maret", Forka Leypey Mathew Fomine’, 

Philippe Lavachery™, Christophe Mbida Mindzie”, Rosine Orban”, Elizabeth Sawchuk”, 
Patrick Semal"®, Mark G. Thomas®®, Wim Van Neer'©”, Krishna R. Veeramah?°, 

Douglas J. Kennett”, Nick Patterson’, Garrett Hellenthal®, Carles Lalueza-Fox”, 


Scott MacEachern”, Mary E. Prendergast'’”>*° & David Reich'?*72°° 


Received: 27 November 2018 


Accepted: 29 November 2019 


Published online: 22 January 2020 


Our knowledge of ancient human population structure in sub-Saharan Africa, 
particularly prior to the advent of food production, remains limited. Here we report 
genome-wide DNA data from four children—two of whom were buried approximately 
8,000 years ago and two 3,000 years ago—from Shum Laka (Cameroon), one of the 


earliest known archaeological sites within the probable homeland of the Bantu 


language group’ ” 


. One individual carried the deeply divergent Y chromosome 


haplogroup AOO, which today is found almost exclusively in the same region”. 
However, the genome-wide ancestry profiles of all four individuals are most similar to 
those of present-day hunter-gatherers from western Central Africa, which implies that 
populations in western Cameroon today—as well as speakers of Bantu languages from 
across the continent—are not descended substantially from the population 
represented by these four people. We infer an Africa-wide phylogeny that features 
widespread admixture and three prominent radiations, including one that gave rise to 
at least four major lineages deep in the history of modern humans. 


The deposits at Shum Laka, a rockshelter located in the Grassfields 
region of western Cameroon, are among the most important archaeo- 
logical sources for the study of Late Pleistocene and Holocene prehis- 
tory in western Central Africa’ *. The oldest human-occupied layers at 
the site date to about 30,000 calendar years before present (BP; taken 
as AD 1950 in accordance with radiocarbon calibration convention), 
but of special interest are artefacts and skeletons dating to between 
the end of the Later Stone Age (about 8,000 BP) and the beginning of 
the Iron Age (about 2,500 BP) (Extended Data Fig. 1, Supplementary 
Information section 1). This transitional period—sometimes referred 
to as the ‘Stone to Metal Age’—featured a gradual appearance of new 
stone tools, as well as pottery” *. Subsistence evidence in the rockshelter 
during the Stone to Metal Age points primarily to foraging, but with an 


increasing use of the fruits of Canarium schweinfurthiithat coincided 
with developments in material culture and served as a foundation for 
later agriculture’ (Supplementary Information section 1, Supplemen- 
tary Table 1). These cultural changes and their early appearance at 
Shum Lakaare particularly intriguing because during the late Holocene 
epoch, the area around the present-day border between Cameroon 
and Nigeria was probably the cradle of the Bantu language group, and 
of populations whose descendants would spread across much of the 
southern half of Africa between about 3,000 and 1,500 BP, resultingin 
the vast range and diversity of Bantu languages today®™. 

A total of 18 human skeletons have been discovered at Shum Laka, 
comprising 2 distinct burial phases 3 (Supplementary Information sec- 
tion 1). We attempted to retrieve DNA from six petrous-bone samples 
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Table 1 | Details of the four Shum Laka individuals in the study 


Identifier Ageat Date Radiocarbon date Sex Mthaplogroup Yhaplogroup Coverage SNPs Mt and X 
death contamination (%) 
2/SE| 4+1 7,920-7,690 6,985+30(PSUAMS-6307) M LOa2a1 B 0.70 564,164 1.0 and 1.0 
2/SE Il 15+3 7,970-7,800 7,090 +35 (PSUAMS-6308) M LOa2a1 AOO 77 1,082,018 1.5 and 0.6 
A/A 8+2 3,160-2,970 2,940+20(PSUAMS-6309) M Lic2a1b B2b 3.83 935,777 0.3 and 0.5 
5/B 4+1 3,210-3,000 2,970+25(PSUAMS-6310) — F Lic2a1b NA 6.41 1,014,618 0.5 and NA 


Age at death is given in years (mean + s.e.m.), and was determined from skeletal remains”. Sex was determined from genetic data. Date is given in calibrated years BP as a 95.4% confidence 
interval (Methods). Radiocarbon date is given in uncalibrated radiocarbon years before present (mean + s.e.m.), with the laboratory sample code shown in parentheses. Coverage refers to the 
average sequencing coverage, and contamination to the estimated contamination from mtDNA (Mt) or the X chromosome (X). NA, not applicable. Y, Y chromosome. Additional information is 


provided in Supplementary Table 2. 


and obtained working data from two individuals of the early Stone to 
Metal Age and two of the late Stone to Metal Age (about 8,000 and 
3,000 BP, respectively) (Table 1, Supplementary Table 2). The two 
earlier individuals—a boy of 4 +1 years old at time of death (given the 
identifying code 2/SE I) lying ontop of the lower limbs of an adolescent 
male of 15 +3 years old (denoted 2/SE IIl)?—were recovered froma pri- 
mary double burial, and the two later individuals—a boy of 8 + 2 years 
(denoted 4/A) and a girl of 4 +1 years (denoted 5/B)’—were in adjacent 
primary single burials. 

We extracted DNA from bone powder and prepared 2 or 4 libraries 
per individual for Illumina sequencing, enriching for about 1.2 million 
target single-nucleotide polymorphisms (SNPs) across the genome 
(Methods, Supplementary Table 2). Final coverage ranged from 0.7 
to 7.7x (from 0.56 to 1.08 million SNPs). The authenticity of the data 
was supported by the observed rate of apparent C-to-T substitutions 
in the final base of sequenced fragments (4-10%, within the expected 
range given our library preparation strategy”) and of heterozygosity 
for mitochondrial DNA (mtDNA) and for the X chromosome in males 
(estimated contamination 0.3-1.5%). We also generated whole-genome 
shotgun sequence data for individuals 2/SE II (about 18.5x coverage) 
and 4/A (about 3.9x coverage), as well as genome-wide data (about 
598,000 SNPs) for 63 individuals from 5 present-day Cameroonian 
populations (Extended Data Table 1, Supplementary Table 3). 


Uniparental markers and kinship analysis 


Allofthe mtDNA and Y chromosome haplogroups we observe at Shum 
Laka are associated today with sub-Saharan Africans. The two earlier 
individuals carry mtDNA haplogroup LOa (specifically LOa2al), whichis 
widespread in Africa, and the two later individuals carry LIc (specifically 
Lic2a1b), whichis found among both farmers and hunter-gatherers in 
Central and West Africa®”*. Individuals 2/SE 1 and 4/A have Y chromo- 
somes from macrohaplogroup B (often found today in hunter-gatherers 
from Central Africa’), and 2/SE Il has the rare Y chromosome haplo- 
group AOO, which was discovered in 2013 and is present at appreci- 
able frequencies only in Cameroon—in particular, among the Mbo and 
Bangwain the western part of the country”. AOO is the oldest known 
branch of the modern human Y chromosome tree, witha split time of 
about 300,000-200,000 Bp from all other known lineages”"*"”. At 
1,666 positions (from whole-genome sequence data; Supplementary 
Table 4) that differ between present-day AOO* and all other Y chro- 
mosomes, the sequence of the Shum Laka individual carries the non- 
reference allele at a total of 1,521, translating to a within-AOO split at 
about 37,000-25,000 Bp (95% confidence interval) (Fig. 1, Methods). 
Leveraging the effects of chromosomal segments that are shared 
identical by descent (IBD), we computed rates of allele matching for 
each pair of individuals to infer degrees of relatedness. Both of the 
contemporaneous pairs display elevated levels of matching: 2/SE I 
and 2/SE II share alleles at the level of fourth-degree relatives, and 4/A 
and 5/B at the level of second-degree relatives (either uncle and niece, 
aunt and nephew or half-siblings) (Extended Data Fig. 2), supporting 
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archaeological interpretations that—during both burial phases—the 
rockshelter was used as a cemetery for extended families”. We would 
expect more recent shared ancestry for the contemporaneous pairs 
even if they were not closely related, but we observe clear signatures 
of long IBD segments across the genome, which confirms their close 
family relatedness (Supplementary Information section 2). All four 
individuals also have evidence of intra-individual IBD, and thus of recent 
inbreeding. 


PCA and allele-sharing statistics 


We visualized the genome-wide relationships between the Shum Laka 
individuals and diverse present-day and ancient sub-Saharan Africans 
(Extended Data Table 1) using principal component analysis (PCA). 
Initially, we computed axes using East and West Africans and hunter- 
gatherers from southern Africa and eastern Central Africa (Fig. 2a). 
The Shum Laka individuals project to the right of present-day West 
African populations and speakers of Bantu languages (hereafter, Bantu- 
speakers) and are closest to present-day hunter-gatherers from Cam- 
eroon (Baka, Bakola and Bedzan”°) and the Central African Republic 
(Aka, often known as Biaka). We then carried out a second PCA using 
only West and East Africans and Aka to compute the axes, and again 
the Shum Laka individuals project in the direction of hunter-gatherers 
from western Central Africa (Fig. 2b). By contrast, present-day groups 
from western Cameroon, who speak languages from the Niger-Congo 
family, cluster tightly with other West Africans (Fig. 2, Extended Data 
Fig. 3a). In both plots, the two earlier Shum Laka individuals fall slightly 
closer to West and East Africans, but—on the basis of their overall simi- 
larity—we grouped all four Shum Laka individuals together for most 
subsequent analyses. 

Using f-statistics (Fig. 3a), we investigated components of ‘deep 
ancestry’ from sources that diverged earlier than the split between 
non-Africans and most sub-Saharan Africans (above point (2) in Fig. 4a). 
We began with the statistic f,(X, Mursi; ancient South African hunter- 
gatherers, Han), which is expected to be positive if deep ancestry in 
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population X induces allele-sharing between X and ancient South 
African hunter-gatherers”!” (with a baseline of zero set by the Mursi, 
a group of pastoralists from western Ethiopia who speak a Nilotic lan- 
guage”). The Shum Laka individuals showa large positive statistic that 
is comparable to that of hunter-gatherers from western Central Africa 
(Fig. 3a top), whereas other West Africans (for example, Yoruba and 
Mende) yield smaller—but still significantly positive (Fig. 3a)—values, 
as do East African hunter-gatherers (the Hadza from Tanzania, as well 
as the ancient individual from Mota Cave in Ethiopia (hereafter, Mota 
individual), dating to approximately 4,500 BP”’). We also obtained 
consistent results from analogous statistics using different reference 
groups (Extended Data Table 2). 

Next, using chimpanzees as an outgroup that is symmetric to all 
human populations, we computed f,(X, Mursi; chimpanzee, ancient 
South African hunter-gatherers) to evaluate whether any of this deep 
ancestry is from sources that diverged more deeply than southern 
African hunter-gatherers (the modern human lineage with the oldest 
known average split date”**), Previous work has shown that southern 
African hunter-gatherers are not asymmetric outgroup relative to 
other sub-Saharan Africans: West Africans (especially the Mende) have 
excess affinity towards deeper outgroups”. Indeed, our test statistic 
is maximized in Mende and other West Africans (Fig. 3a, bottom). The 
Hadza, and the Mota individual, have values close to zero, andthe Shum 
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Fig. 3 | Allele-sharing statistics. a, Statistics sensitive to deep ancestry 
(mean+2s.e.m., multiplied by 1,000). Blue, deeper than non-Africans; red, 
deeper than southern African hunter-gatherers; computed on 1,121,119 SNPs. 
SA, ancient South African hunter-gatherers; S.L., Shum Laka. b, Relative allele 
sharing (mean +s.e.m., multiplied by 10,000; computed on 538,133 SNPs) with 
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western Central hunter-gatherer group ina comprises the Aka plus Cameroon 
hunter-gatherers (Baka, Bakola and Bedzan). 


Laka individuals and Central African hunter-gatherers are intermediate. 
Some populations yield positive values for bothf,-statistics (Fig. 3a), 
but the two sets are poorly correlated, which implies that they—at least 
in part—reflect separate signals. 

Combining our newly genotyped individuals with published data”’, 
we searched for differential allele-sharing between the Shum Laka indi- 
viduals (compared to either East Africans (Somali) or to the Aka) and 
present-day Cameroonians (Fig. 3b, Extended Data Fig. 3b). We identi- 
fied three distinct clusters: (1) Mada and Fulani, (2) hunter-gatherers 
and (3) other populations who speak languages in the Niger-Congo 
family (shown in close-up in Fig. 3b). Within the third cluster are the 
only groups—Mbo, Aghem and Bafut (all of whom live close to the site 
of Shum Laka today)—with significantly Shum-Laka-directed statis- 
tics in both dimensions, consistent with small proportions of Shum- 
Laka-related admixture (a maximum of about 7-8%) (Supplementary 
Information section 3). 


Admixture graph analysis 

Finally, we built an admixture graph (Fig. 4a, Extended Data Fig. 4, 
Methods) comodelling the Shum Laka, Mota and ancient South African 
hunter-gatherer individuals; present-day Mbuti, Aka, Agaw (speak- 
ers of an Afroasiatic language, from Ethiopia”’), Yoruba, Mende and 
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the Shum Laka individuals versus East Africans (f,(X, Yoruba; Shum Laka, 
Somali); x axis) and versus Aka (f,(X, Yoruba; Shum Laka, Aka); y axis) for 
present-day populations from Cameroon (blue) and southern (Herero, red) and 
eastern (Chewa, orange) Bantu-speakers. See also Extended Data Fig. 3b. 
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multifurcations. Key points are (1) the early modern human split, (2) East 
African divergences and (3) the expansion of ancestry associated with Bantu- 
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Lemande; non-Africans (French); and two outgroups (Neanderthal 
and chimpanzees). We also fit versions of the model using alternative 
SNP ascertainments and additional populations (Hadza, Mbo, Herero, 
Chewa, Mursi, Baka, Bakola, Bedzan, Mada, Fulaniand ancient individu- 
als from Taforalt in Morocco”) and obtained similar results (Extended 
Data Table 3, Supplementary Information section 3). 

Among modern humans, the deepest-splitting branch is inferred to 
be the one that leads to Central African hunter-gatherers, although four 
lineages diverge ina very short span: those that contribute the primary 
ancestry to (1) Central African hunter-gatherers, (2) southern African 
hunter-gatherers and (3) other modern human populations, along with 
(4) a‘ghost’ source that contributes a minority of the ancestry in West 
Africans and the Mota individual. Central African hunter-gatherers sep- 
arate into eastern (Mbuti) and western clades; the latter then branches 
into components represented in Aka and the Shum Laka individuals. 
Next, asecond cluster of divergences involves West Africans, two East 
African lineages (one associated with hunter-gatherers and another 
with agro-pastoralists) and non-Africans (who are tentatively inferred 
to split closest to the Mota individual, but with no deep ghost ancestry). 
Within the West African clade, we identify Yoruba and Mendeas sister 
groups (with Lemandeas an outgroup), and—most basally—a separate 
lineage that contributed to the Shum Laka individuals (64%). A source 
associated with Bantu-speakers (most closely related to Lemande) 
contributes 59% of the ancestry in Aka and 26% in Mbuti (who also 
have ancestry (17%) from a source related to agro-pastoralists of East 
Africa). In a model separating the two pairs of Shum Laka individuals, 
the pair dating to 3,000 BP have about 5% more ancestry related to 
hunter-gatherers from Central Africa (as confirmed by the significantly 
positive statistic f,(Shum Laka 8,000 BP, Shum Laka 3,000 BP; Yoruba, 
Aka) (Z=4.2)) (Supplementary Information section 3). 

We can also obtain a good fit for the Shum Laka individuals in a less- 
parsimonious alternative model using three components, replacing the 
basal West African source with a combination of ancestry from inside the 
clade defined by the other West African populations and from asource 


668 | Nature | Vol577 | 30 January 2020 


Fig. 4).*Proportion not well-constrained. b, Geographical structure. Shaded 
areas denote the hypothesized historical locations of lineages descended from 
split point (1) ina, and branching order is shown for populations descended 
from split point (2) (one ancestry component per population, with leaf nodes at 
sampling locations). The blue star represents Shum Laka (dashed line, possible 
direction of gene flow). 


entirely outside the West African clade (near one lineage that contrib- 
utes to the Taforalt individuals) (Extended Data Fig. 5, Supplementary 
Information section 3). However, two-component models for the Shum 
Laka individuals that have the majority source splitting closer to other 
West or East Africans are rejected (Z=7.1 and Z=3.7, respectively). 

The West African clade is distinguished by admixture from a deep 
source that can be modelled as a combination of modern human and 
archaic ancestry. The modern human component diverges at almost 
the same point as Central and southern African hunter-gatherers and is 
tentatively related to the deep source that contributes ancestry to the 
Mota individual, and the archaic component diverges close to the split 
between Neanderthals and modern humans (Supplementary Informa- 
tion section 3). The signals of deep ancestry in groups related to the 
West African clade (Fig. 3a) can be explained by two admixture events: 
one along the ancestral West African lineage, and a second, smaller 
contribution (about 4%) to Mende from the same source (Fig. 4a). 
Accordingly, f,-statistics testing for ancestry basal to southern African 
hunter-gatherers (Fig. 3a, bottom) are well-correlated with the inferred 
proportions of ancestry from the West African clade (Extended Data 
Fig. 6). We estimate the shared admixture to introduce 10% deep mod- 
ern humanand 2% archaic ancestry, although the first proportion is not 
well-constrained (Extended Data Table 3). An alternative model with 
noarchaic component-—in which the West African clade receives deep 
ancestry froma single source” that splits before point (1) in Fig. 4a—also 
provides areasonable fit to the data (Extended Data Fig. 7, Supplemen- 
tary Information section 3), although it does not account for previous 
evidence of archaic ancestry in sub-Saharan Africans” *. 


Genetics and archaeology at Shum Laka 


Our analyses show that the 4 sampled children from Shum Laka can 
be modelled as admixed with about 35% ancestry related to hunter- 
gatherers from western Central Africa and about 65% from a basal West 
African source, or—alternatively—as a mixture of ancestry related to 


hunter-gatherers plus two additional components, one from inside the 
clade of present-day West Africans and one that splits between East 
and West Africans. The first component plausibly represents ances- 
try that has been present in the area since at least the Later Stone Age 
(prior to 8,000 BP), whereas the second component (or the third inthe 
alternative model) may have its origins farther to the north, given the 
geography and phylogeny of the other populations we studied (Fig. 4b). 
The chronology of the archaeological record at Shum Laka also sug- 
gests a possible northern influence on cultural developments during 
the Stone to Metal Age®”. These developments include (1) changes in 
stone tools (which can be interpreted as a fusion of local tool-making 
traditions of the Later Stone Age with new macrolithic technologies that 
were introduced from the north’), and (2) the appearance of ceramics 
(four sherds have been found in the burial layer associated with the 
early Stone to Metal Age, and more abundant and distinct ceramics 
are found in later Stone to Metal Age deposits), which are potentially 
related to earlier pottery-working traditions in the Sahara and Sahel??. 
Moreover, gene flow from the north before 8,000 BP is plausible in light 
of ashort period of Saharan and Sahelian aridification**’, which could 
have contributed to population movements. Present-day groups in 
northern West Africa and the Sahel have substantial admixture con- 
nected to later migrations*, so identifying the exact source area may 
require additional ancient DNA studies. 

Although the scope of our sampling is limited to two individuals at 
either end of the Stone to Metal Age, the observed genetic similarity 
across aspan of almost 5,000 years—a similarity that is consistent with 
skeletal morphometric analyses—suggests a long-term presence of 
related peoples who used the rockshelter for various activities, includ- 
ing burying their dead (Supplementary Information section 1). How- 
ever, most populations in Cameroon today are more closely related to 
other West Africans than to the group represented by these individuals. 
Present-day hunter-gatherers in Cameroon are also not descended 
substantially from this specific group, as they lack the signal of basal 
West African ancestry (Supplementary Information section 3). We 
do observe elevated levels of allele-sharing between the Shum Laka 
individuals and present-day populations of the Grassfields region, so 
the genetic discontinuity is not absolute. Additionally, the adolescent 
male 2/SE II carried an AOO Y chromosome, which suggests both that 
the concentration of this haplogroup in western Cameroon may have 
along history and that AOO was formerly more diverse (given that the 
Shum Laka sequence falls outside of known present-day variation)”, 
The divergence time of AOO from other modern human haplogroups, 
of about 300,000-200,000 Bp'*”’, could support its association either 
with the component of the ancestry of the Shum Laka individuals that 
is related to Central African hunter-gatherers or with the deep modern 
human portion of their West-African-related ancestry. 

Linguistic and genetic evidence points to western Cameroon as the 
most likely area for the development of Bantu languages and as the ulti- 
mate source of subsequent migrations of Bantu-speakers, and—although 
the regional mid-Holocene archaeological record is sparse—Shum Laka 
has previously been highlighted as a site that was potentially important 
in the early phase of this process! ** ", However, the genetic profiles of 
our four sampled individuals—even by about 3,000 BP, when the spreads 
of Bantu languages and of ancestry associated with Bantu-speakers were 
already underway-—are very different from those of most speakers of 
Niger-Congo languages today, which implies that these individuals are 
not representative of the primary source population(s) that were ances- 
tral to present-day Bantu-speakers. These results neither support nor 
contradict a central role for the Grassfields area in the origins of Bantu- 
speakers, and it may be that multiple, highly differentiated populations 
formerly lived inthe region—with potentially either high or lowlevels of 
linguistic diversity. It would not be surprising if the Shum Laka site itself 
was used (either successively or concurrently) by multiple groups with 
different ancestry, cultural traditions or languages’, evidence of which 
may not be visible from the collection of remains as preserved today. 


Implications for deep population history 


By analysing data from Shum Laka and other ancient individuals in 
conjunction with present-day groups, we gain insights into African 
population structure on multiple timescales. First, we infer a series of 
closely spaced population splits that involve one West-African-related 
lineage and two East-African-related lineages, as well as non-Africans 
(point (2) in Fig. 4a). The geography of the populations involved sug- 
gests the centre of this radiation was plausibly in East Africa (Fig. 4b), 
and estimated divergences of African and non-African populations 
place its date at about 80,000-60,000 Bp™**>. Such an expansion is 
also consistent with mtDNA phylogeography-—specifically the diver- 
sification of haplogroup L3, which probably originated in East Africa 
about 70,000 Bp**”’—and potentially with the origins of clade CT inthe 
Y chromosome tree at a similar time depth’*”’, 

Second, we infer a phase of divergences that involved at least four lin- 
eages early in the history of modern humans (point (1) in Fig. 4a). Recent 
consensus has been that southern African hunter-gatherers, who split 
from other populations about 250,000-200,000 BP, represent the 
deepest sampled branch of modern human variation””*”». Our results 
suggest that Central African hunter-gatherers split at close to the same 
time (or perhaps slightly earlier), and thus that both clades—as well as 
the lineage that would later diversify at point (2) (Fig. 4a)—originated 
as part of a large-scale radiation. 

In addition to the well-characterized deep lineages, we also detect at 
least one deep ghost source that contributed to West Africans and East 
African hunter-gatherers. This signal corroborates previous evidence 
for the Hadza and Sandawe” and for West Africans”, although we find 
that the best fit is a source that splits near the same point as southern 
and Central African hunter-gatherers. Our results are also consistent 
with previous reports of archaic ancestry in African populations”, 
specifically in West Africans. The presence of deep ancestry in the 
West African clade is notable in light of the Pleistocene archaeologi- 
cal record*“°, which is limited but includes Homo sapiens fossils dated 
to about 300,000 BP in northwestern Africa”, as well as an individual 
with archaic features buried about 12,000 BP in southwestern Nigeria 
(the oldest known human fossil from West Africa proper)*”. Middle 
Stone Age artefacts have also been found in parts of West Africa into 
the terminal Pleistocene”, despite the development of Later Stone Age 
technologies elsewhere (as, for example, at Shum Laka). Thus, the avail- 
able material and fossil evidence is concordant with our genetic results 
inindicating long-term African population structure and admixture**, 

Further genetic studies may reveal additional complexities in deep 
human population history, although some early human groups will prob- 
ably remain known only through fossils***°. On the basis of our current 
understanding, the presence of at least 4 modern human lineages that 
diversified about 250,000-200,000 BP and are represented in people 
living today provides further support for archaeological evidence that 
suggests this era was a pivotal period for human evolution in Africa. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Ancient DNA sample processing 

We obtained bone powder from the Shum Laka skeletons (see Sup- 
plementary Information section 1 for more information on the site 
and burials) by drilling cochlear portions of petrous bone samples in 
aclean room facility at the Royal Belgian Institute of Natural Sciences. 
In dedicated clean rooms at Harvard Medical School, we extracted 
DNA using published protocols***”. From the extracts, we prepared 
barcoded double-stranded libraries treated with uracil-DNA glyco- 
sylase (UDG) to reduce the rate of characteristic ancient DNA dam- 
age“**8 in a modified partial UDG preparation that included magnetic 
bead clean-ups’. For the SNP capture data, we used two rounds of 
in-solution target hybridization to enrich for sequences that overlap 
the mitochondrial genome and approximately 1.2 million genome- 
wide SNPs°°**. We then added 7-base-pair indexing barcodes to the 
adapters of each library and sequenced onan Illumina NextSeq 500 
system with 76-base-pair paired-end reads. For individuals 2/SE II and 
4/A, we also generated whole-genome shotgun data from the same 
libraries but without the target enrichment step. Sequencing was 
performed at the Broad Institute on Illumina HiSeq X Ten systems, 
using 19 lanes for 2/SE II (yielding approximately 18.5x average cov- 
erage, including 1,216,658 sites covered from the set of target SNPs 
used in most analyses) and two lanes for 4/A (3.9x average coverage, 
1,158,884 sites covered). 

From the raw sequencing results, we retained reads with no more 
than one mismatch per read pair to the library-specific barcodes. Before 
alignment, we merged paired-end sequences on the basis of forward 
and reverse mate overlaps and trimmed barcodes and adapters. Pre- 
processed reads were then mapped to both the mitochondrial reference 
genome RSRS” and the human reference genome (version hg19) using 
thesamse command with default parameters in BWA (version 0.6.1)°°. 
Duplicate molecules (with the same mapped start and end positions 
and strand orientation) were removed after alignment. We filtered 
the mapped sequences (requiring mapping quality scores of at least 
10 for targeted SNP capture and 30 for whole-genome shotgun data) 
and trimmed 2 terminal bases to eliminate almost all damage-induced 
errors. 

For mtDNA, we called haplogroups using HaploGrep2”. For nuclear 
DNA obtained from SNP capture and for the whole-genome shotgun 
data for individual 4/A, we selected one allele at random per site to 
create pseudohaploid genotypes. For the whole-genome shotgun data 
for individual 2/SE II, we used a previously described reference-bias- 
free diploid-genotype calling procedure”, converting the resulting 
genotypes into a fasta-like encoding that allows for extraction of data 
at specified sites using cascertain and cTools*. We determined the sex 
of each individual by examining the fractions of sequences mapping 
to the X and Y chromosomes”, and we determined Y chromosome 
haplogroups by comparing sequence-level SNP information to the 
tree established by the International Society of Genetic Genealogy 
(http://www.isogg.org). 

To ensure authenticity, we computed the proportion of C-to-T 
deamination errors in terminal positions of sequenced molecules and 
evaluated possible contamination via heterozygosity at variable sites in 
haploid genome regions, using contamMix*’ and ANGSD” for mtDNA 
and the X chromosome (in males), respectively. Observed damage 
rates (4-10%) were relatively low but within the expected range after 
partial UDG treatment”, and apparent heterozygosity rates for mtDNA 
(0.3-1.5% estimated contamination) and the X chromosome (0.5-1.0% 
estimated contamination) were minimal. The molecular preservation of 
the samples is impressive given the long-term warm and humid climate 


at Shum Laka® (which supports a mixed forest-savannah environment, 
at an elevation of about 1,650 m above sea level). 


Radiocarbon dates 

At the Pennsylvania State University (PSU) Radiocarbon Laboratory, 
we generated direct radiocarbon dates via accelerator mass spec- 
trometry (AMS) for the four analysed individuals, using fragments 
of the same temporal bone portions that were sampled for ancient 
DNA. We extracted and purified amino acids using a modified XAD 
process“ and assessed sample quality using stable isotope analysis. C:N 
ratios for all 4 samples fell between 3.3 and 3.4, well within the nominal 
range of 2.9-3.6 that indicates good collagen preservation™. The PSU 
AMS dates were in good agreement with previously reported direct 
dates for different bones from individuals 2/SE II (7,150 + 70 uncali- 
brated radiocarbon years before present, calibrated to 8,160-7,790 BP 
(Oxford Radiocarbon Accelerator Unit sample code OxA-5203)) and 4/A 
(3,045 + 60 uncalibrated radiocarbon years before present, calibrated 
to 3,380-3,010 BP, OxA-5205)!?3*, but on the basis of a (modestly) 
aberrant date® froma rib of individual 2/SEI (Supplementary Table5), 
we restricted our final reported results to the temporal bones. We per- 
formed calibrations using OxCal® version 4.3.2 with a mixture of the 
IntCal13° and SHCal13° curves, specifying ‘U(0,100)’ to allow fora 
flexible combination”, and rounding final results to the nearest 10 
years (Supplementary Information section 1). 


Present-day data 

We generated genome-wide SNP genotype data for 63 individuals from 
5 present-day Cameroonian populations on the Human Origins array: 
Aghem (28), Bafut (11), Bakoko (1), Bangwa (2) and Mbo (21) (Extended 
Data Table 1, Supplementary Table 3). Samples were collected with 
informed consent, with collection and analysis approved by the UCL/ 
UCLH Committee on the Ethics of Human Research, Committee A and 
Alpha. 


AOO Y chromosome split time estimation 

Present-day AOO Y chromosomes are classified into the subtypes AO0a, 
AOOb and AO0c, the divergence times of which from each other have 
not been precisely estimated but are quite recent—perhaps only a few 
thousand years ago”. To estimate the split time of the Shum Laka AOOY 
chromosome from present-day AOO, we called genotypes for individual 2/ 
SEII (from our whole-genome sequence data) ata set of positions at which 
sequences from two present-day individuals with haplogroup AOO* differ 
from all non-AOO individuals. At every subtype-specific site for which we 
had coverage, the Shum Laka AOO carries the ancestral allele. To avoid 
needing to determine the status of mutations as ancestral or derived, we 
considered the entire unrooted lineage specific to AOO (Fig. 1). The total 
time span represented by this lineage is approximately 359,000 years, 
using published values of about 275,000 BP for the divergence of the AOO 
lineage from other modern human haplogroups” and about 191,000 BP 
forthe next-oldest split within macrohaplogroup A”. Witha requirement 
of at least 90% agreement among the reads at each site, we called 1,521 
positions as having the alternative allele (that is, matching the present- 
day AOO and differing from the human reference sequence) and 145 as 
having the reference allele (taking the average of 143 and 147 for the 2 
present-day individuals). The fraction 145/(145 + 1,521) then defines the 
position of the split of the Shum Laka individual along the (unrooted) 
AOOlineage. Split times computed either from all sites (relaxing the 90% 
threshold and using the majority allele), or from additionally requiring 
at least two reads per site, differ from our primary estimate by only afew 
hundred years. To produce a confidence interval, we used the variance in 
the published estimates and assumed an independent Poisson sampling 
error forthe number of observed reference alleles. The final point estimate 
was about 31,000 Bp (95% confidence interval, 37,000-25,000 BP), which 
means that the AOO of the Shum Laka individual (with a sample date of 
about 8,000 BP) cannot be directly ancestral to the present-day subtypes. 
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PCA andallele-sharing statistics 

We performed PCA using smartpca (with the Isqproject and autoshrink 
options)" and computed f,-statistics using ADMIXTOOLS (with stand- 
ard errors estimated via block jackknife over 5 cM chromosomal seg- 
ments)”*. We projected all ancient individuals in PCA rather than using 
them to compute axes to avoid artefacts caused by missing data. In 
each PCA, we also projected a subset of the present-day populations 
to enable controlled comparisons with ancient individuals. In most 
cases, reported f,-statistics are based onthe approximately 1.15 million 
autosomal SNPs from our target capture set. For PCA and for f,-statistics 
testing differential relatedness to the Shum Laka individuals, we used 
autosomal SNPs from the Human Origins array (a subset of the target 
capture set), with some populations in the analyses only genotyped 
onthis subset (Extended Data Table 1). For these latter f,-statistics, we 
excluded for all populations a set of roughly 40,000 SNPs having high 
missingness in the present-day Cameroon data. 


Admixture graphs 

We fit admixture graphs with the ADMIXTUREGRAPH (qpGraph) pro- 
gram in ADMIXTOOLS (with the options ‘outpop: NULL, ‘lambdascale: 
1’, ‘inbreed: YES’ and ‘diag: 0.0001’) ®, using the 1.15 million autosomal 
SNPs from our target capture set by default, and other sets of SNPs 
in alternative model versions as specified. The program requires as 
input the branching order of the populations in the graph and a list 
of admixture events, and it then solves for the optimal parameters of 
the model (branch lengths and mixture proportions) via an objective 
function measuring the deviation between predicted and observed 
values of a basis set of f statistics. From the inferred parameters, poorly 
fitting topologies (including positions of admixture sources) can be 
corrected by changing split orders at internal nodes that appear as tri- 
furcations under the constraints enforced by the input (Supplementary 
Information section 3). 

To evaluate the fit quality of output models, we used two metrics: 
first, alist of residual Z-scores for all f-statistics relating the popula- 
tions inthe graph, and second, acombined approximate log-likelihood 
score. The first metric is useful for identifying particularly poorly fit- 
ting models and the elements that are most responsible for the poor 
fits, and the second provides a means for comparing the overall fits 
of separate models (Supplementary Information section 3). To assess 
the degree of constraint on individual parameter inferences, we were 
guided primarily by the variability across different model versions 
(using different populations and SNP sets) (Extended Data Table 3, 
Supplementary Information section 3), which reflects both statistical 
uncertainty and changes in model-specific assumptions. In our primary 
model, all f-statistics relating subsets of the populations are predicted 
to within 2.3 standard errors of their observed values. 

Initially, we detected a slight—but significant—signal (maximum 
Z=2.5) of allele-sharing between the Shum Laka individuals and non- 
Africans, which we hypothesize is due to a small amount of DNA con- 
tamination. To prevent this effect from influencing our results, we 
included a‘dummy’ admixture of non-African ancestry into the Shum 
Laka individuals (inferred 1.1%, consistent with mtDNA- and X chromo- 
some-based contamination estimates), although model parameters 
without the dummy admixture are also very similar (Extended Data 
Table 3, Supplementary Information section 3). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The aligned sequences are available through the European Nucleotide 
Archive under accession number PRJEB32086. Genotype data used in 


analysis are available at https://reich.hms.harvard.edu/datasets. Any 
other relevant data are available from the corresponding author upon 
reasonable request. 
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Extended Data Fig. 1| Overview of the site of Shum Laka. The left column iron objects. Column 7 indicates the two burial phases. Column 8 shows 

represents generalized stratigraphy, with radiocarbon dates (uncalibrated climatic reconstructions based on carbon stable isotopes and pollen from 

years before present) shownas red dots onthe yaxis, anddepositsindicatedby organic matter extracted from sediment cores at Lake Barombi Mbo in western 

their archaeological nomenclature. P, S/Si, Pleistocene; T, A, Holocene; Ao, Cameroon (more arid conditions to the left and more humid conditions to the 


Holocene ochre ashy layer; Ag, Holocene grey ashy layer (after ref. °). Columns right®”), along with archaeological eras. IA, Iron Age; LSA, Later Stone Age; 
1-6 display the chronological extents of technological traditions:1, microlithic | SMA,Stoneto Metal Age. RMCA Collection; drawings by Y. Paquay, 
quartzindustry; 2, macrolithic flake and blade industry on basalt; 3, bifaces of composition © RMCA, Tervuren; modified by E. Cornelissen’’. 

the axe-hoe type; 4, pecked grounded adze and arrow heads; 5, pottery; and 6, 
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Extended Data Fig. 2 | Kinship analysis. Mean genome-wide allelic mismatch inbreeding also serves to reduce the rate of mismatches. For 4/A and 5/B, we 
rates for each pair of individuals (blue), as well as intra-individual comparisons can eliminate a grandparent-grandchild relationship because both died as 
(red), are shown. We selected one read per individual at random at each children, and the lack of long segments with IBD sharing on both homologous 
targeted SNP (using all 1,233,013 targeted sites). Monozygotic twins (or intra- chromosomes implies that they are not double cousins (the few ostensible 
individual comparisons) are expected to havea value one-half as large as double-IBD stretches are probably a result of inbreeding (Supplementary 
unrelated individuals; first-degree relatives, halfway between monozygotic Information section 2)). Thus, we can conclude that they were either uncle and 
twins and unrelated individuals; second-degree relatives, halfway between niece (or aunt and nephew) or half-siblings. Bars show 99% confidence intervals 


first-degree relatives and unrelated individuals; and so on. The presence of (computed by block jackknife). 
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Extended Data Fig. 3 | Alternative PCA and allele-sharing analyses. a, Broad- 
scale PCA (differing from Fig. 2a by projecting all present-day Cameroon 
populations; again using 593,124 Human Origins SNPs). Groups shown in blue 
were projected onto axes computed using the other populations. HG, hunter- 
gatherers. The grouping marked W-Cent. HG consists of Akaand Cameroon 
hunter-gatherers (Baka, Bakola and Bedzan). The majority of the present-day 
Cameroon individuals fall in a tight cluster near other West Africans and Bantu- 
speakers. b, Relative allele sharing (mean +s.e.m., multiplied by 10,000, 
computed 0n 538,133 SNPs, as in Fig. 3b) with the Shum Laka individuals versus 


East Africans (f,(X, Yoruba; Shum Laka, Somali); x axis) and versus Aka (f,(X, 
Yoruba; Shum Laka, Aka); y axis) for present-day populations from Cameroon 
(blue points) and southern and eastern Bantu-speakers (Herero in red and 
Chewa in orange). The Mada and Fulani share more alleles with the Shum Laka 
individuals than they do with the Aka, but this is probably asecondary 
consequence of admixture from East or North African sources (as reflected in 
greater allele sharing with Somali individuals) (Supplementary Information 
section 3). Bars show ones.e.m. in each direction. 
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Extended Data Fig. 4 | Primary inferred admixture graph with full (in units of squared allele frequency divergence) are rounded to the nearest 
parameters. Of the approximately 1,200,000 targeted SNPs, 932,000areused __ integer. Allf-statistics relating the populations are predicted to within 2.3 
for fitting (that is, are covered by all populations in the model). Branchlengths standard errors of their observed values. 
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portion of the West African clade, and one that splits atnearlythesamepointas Supplementary Information section 3 provides the full parameters of the 
one of the sources that contributes ancestry to the Taforalt individuals. Branch inferred model. 


0.0025 


0.002 
Mende 
oO 
@ 0.0015 Lemande 
Yoruba 

= Aka 

@ 

& 

é 0.001 Shum Laka 

o 

oO : 

2 Mbuti 

® 0.0005 

Q 

Mota 
0 : 
Mursi 20 40 60 80 100 
-0.0005 ; 
Percent West African-related ancestry 

Extended Data Fig. 6 | Deep ancestry correlation from the West African related to the West African clade (from admixture graph results; the Mota 
clade. An allele-sharing statistic sensitive to ancestry that splits more deeply individual, Yoruba and Lemandeare shifted slightly away from the boundaries 
than southern African hunter-gatherers (f,(X, Mursi; chimpanzee, ancient for legibility). The (relative) allele-sharing rate for Mursiis zero according to the 
South African hunter-gatherers), mean +2s.e.m. from block jackknife, definition of the statistic. 
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(the order of which we cannot confidently assess) but are not meant to 


Extended Data Table 1| Populations in the study 


Population Country Language Date Sample Data Reference 
family size type 
Shum Laka Cameroon ~8000-3000 BP 4/1/1 1240k/DG/SG This paper 
Ancient Malawi HG = Malawi ~8100—2500 BP 7* 1240k [22] 
Mota Ethiopia ~4500 BP 1 SG [23] 
Ancient South South Africa ~2000 BP 3t SG [21,22 
African HG 
Taforalt Morocco ~15,000-14,000 BP_ 6 1240k [26] 
Altai Neanderthal Russia ~120,000 BP 1 DG [78] 
Aghem Cameroon NC Present 28 HO This paper 
Bafut Cameroon NC Present 11 HO This paper 
Baka Cameroon NC Present 2 DG [20] 
Bakoko Cameroon NC Present 1 HO This paper 
Bakola Cameroon NC Present 2 DG [20] 
Bangwa Cameroon NC Present 2 HO This paper 
Bedzan Cameroon NC Present 2 DG [20] 
Fulani Cameroon NC Present 2 DG [20] 
Lemande Cameroon NC Present 2 DG [25] 
Mada Cameroon AA Present 2 DG [20] 
Mbo Cameroon NC Present 21 HO This paper 
Ngumba Cameroon NC Present 2 DG [20] 
Tikar Cameroon NC Present 2 DG [20] 
Agaw Ethiopia AA Present 2 DG [20] 
Aka (Biaka) Central African NC Present 20/2 HO/DG [22, 25] 
Republic 
Chewa Malawi NC Present 11 HO [22] 
Dinka Sudan NS Present 7/4 HO/DG [22, 25] 
French France IE Present 3 DG [25] 
Hadza Tanzania KS Present 5(2)/1 HO/DG [22, 25] 
Han China ST Present 4 DG [25] 
Herero Namibia NC Present 2 DG [25] 
Khoesan Namibia KS Present 22 HO [22] 
Mbuti DR Congo NC, NS Present 10/4 HO/DG [22, 25] 
Mende Sierra Leone NC Present 8/2 HO/DG [22, 25] 
Mursi Ethiopia NS Present 2 DG [20] 
Sandawe Tanzania KS Present 22 HO [22] 
Somali Kenya AA Present 13 HO [22] 
Yoruba Nigeria NC Present 70/3 HO/DG [22, 25] 


List of populations in the analyses in this study. Data types are in-solution targeted SNP capture (1240k); whole-genome sequences with pseudohaploid genotype calls (SG); high-coverage 
whole-genome sequences with diploid genotype calls (DG); and Human Origins SNP array (HO). For some populations, we used different sets of samples for different analyses (indicated by 
forward slashes); individuals genotyped on the Human Origins array were used for PCA and for f-statistics testing differential relatedness to Shum Laka individuals (Fig. 3b, Extended Data 
Fig. 3b). For the Hadza, we used five individuals with Human Origins data for PCA and two of these five individuals for admixture graph modelling. Data for the Altai Neanderthal individual are 


from ref. ’°. AA, Afroasiatic; IE, Indo-European; KS, Khoesan; NC, Niger-Congo; NS, Nilo-Saharan; ST, Sino-Tibetan. 
*Individuals from Hora, Chencherere and Fingira. 
tIndividuals from Ballito Bay (A and B) and St Helena Bay. 
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Extended Data Table 2 | Allele-sharing statistics for deep ancestry 


fa(X, Mursi; SA, Han) | /4(X, Mota; SA, Han) | /,(X, Han; SA, Mursi) f4(X, Mota; SA, Mursi) 
Test pop Value Z-score Value Z-score Value Z-score Value Z-score 
Dinka 1.4 5.8 -2.0 -5.5 0.1 0.2 -6.3 -20.2 
Mota 3.4 9.0 0 0 6.3 18.1 0 0 
Hadza 41 10.3 0.8 1.7 7.3 21.2 0 2.7 
Yoruba 4.7 17.8 Leg, 3.8 5.2 18.2 -1.1 -3.5 
Lemande 5.0 16.8 1.7 4.5 5.7 18.2 -0.6 -2.1 
Mende 5.7 19.1 2.3 6.3 6.3 20.0 0 0 
Shum Laka 11.7 38.7 8.3 22.6 12.7 40.8 6.4 20.5 
Aka 13.3 39.1 9.9 25.2 13.6 40.4 7.3 22.0 
Mbuti 16.4 50.4 13.0 34.9 16.4 49.9 10.0 31.8 
Mursi 0 0 -3.4 -9.0 Fa vi i re 
Agaw 0.1 0.3 -6.2 -18.9 

f4(X, Mursi; SA, Mota) | /,(X, Han; SA, Mota) f4(X, Han; SA, Yor) fs(X, Mursi; Chimp, Yor) 
Test pop Value Z-score Value Z-score Value Z-score Value Z-score 
Dinka 0.8 3.3 3.7 11.9 -0.7 -2.8 -0.9 4.7 
Mota ~ 8 uy iy 57 18.1 5:2 17.7 
Hadza 41 11.5 7.0 17.7 4.8 15.2 3.4 11.4 
Yoruba 4.1 15.7 7.1 21.6 
Lemande 41 14.5 7.1 21.0 
Mende 4.8 17.3 7.8 22.5 es H $e 
Shum Laka 941 29.8 12.0 33.7 8.0 28.7 8.3 31.9 
Aka 10.3 33.4 13.2 35.5 7.8 24.8 8.5 30.1 
Mbuti 12.5 41.8 15.5 44.1 11.6 40.8 11.8 46.3 
Mursi 0 0 3.0 8.8 0.6 2.2 0 0 
Agaw -2.4 -7.7 0.6 1.8 0 0.2 -0.2 -0.9 
SA 20.3 66.0 


Variations of allele-sharing statistics (multiplied by 1,000; computed on 1,121,119 SNPs) sensitive to ancestry in the test population X from a deeply-splitting lineage, along with Z-scores for 
difference from zero. The zero level has a different meaning depending on which population is in the second position in the statistic. Blank entries are statistics that are confounded by specific 
relationships between the test population and one of the reference populations (in the third or fourth position; either duplication of the same group, Agaw with Han owing to ancestry related 
to non-Africans, or Yoruba with other West Africans). From the statistics f,(Mursi or Agaw, Han; ancient South African hunter-gatherers, Yoruba), we find minimal differences in deep ancestry 


proportions among Han, Mursi, and Agaw; from f,(X, Mursi; chimpanzee, Yoruba), we obtain a value for ancient South African hunter-gatherers that is roughly twice as large as that for Central 


African hunter-gatherers. SA, ancient South African hunter-gatherers; Yor, Yoruba. 


Extended Data Table 3 | Admixture graph parameter estimates 


Model version: 123 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 

Mixture proportions (%) 
Shum Laka 64 66 62 71 64 58 63 61 63 61 64 64 64 64 63 63 63 .. 64 61 69 63 67/ 
basal WA 62* 
Aka Bantu- 59 59 57 63 59 56 58 57 59 58 59 59 59 59 59 59 58 58 59 58 62 61 59 
associated 
Mbuti Bantu- 26 24 33 19 28 27 26 12 28 30 32 25 24 26 29 28 35 35 25 35 23 36 27 
associated 
Mbuti East African- 17 19 10 27 14 9 16 23 15 13 11 19 20 1813 14 6 6 18 9 23 8 16 
related 
West Africanclade 2 2 4 43 3 3 22 2 3 22 2 3 33 2«.@«.~«.~«. 2 
archaic 
West Africanclade 10 9 17 8 12 29 15 24111819 9 8 9 14132929 . . . 2. 11 
deep modern human 
Mende deep 4443 43 465 55 44 45 55 544 4 3 4 
ancestry 
Mota deep 29 29 30 29 30 31 31 30 29 31 29 29 29 28 30 31 29 29 29 30 27 26 29 
ancestry 

Branch lengths 

Basal WA 2333 3 13 22 22 23 3223«.@ 23 3 1 3 
split’ 
South AfricanHG 11 04 1 -1 41 21414 «1 «4 «4 4 «4 «4°14 1 1 04 0 1 
split* 
Ghost modern 1131231370217 0-771 41 04% «4 4 2. 2 2. 2 2 
human split* 


Key admixture graph parameter estimates across different model versions. Supplementary Information section 3 provides the full details. 1,Primary model; 2, no ‘dummy’ admixture; 3, African- 
ascertained SNPs; 4, transversion SNPs; 5, whole-genome sequence data for the Shum Laka individuals; 6, outgroup-ascertained transversions; 7, Hadza added; 8, Mbo in place of Lemande; 9, 
Herero added; 10, Chewa added; 11, Mursi in place of Agaw; 12, Baka added; 13, Bakola added; 14, Bedzan added; 15, Mada added; 16, Fulani added; 17, Taforalt added; 18, alternative admixture 
for the Shum Laka individuals; 19, alternative deep source; 20, alternative deep source with African-ascertained SNPs; 21, alternative deep source with transversion SNPs; 22, alternative deep 
source with outgroup-ascertained transversions; 23, pairs of Shum Laka individuals fit separately. WA, West African. 

*Earlier pair/later pair. 

tUnits above the main West African clade. 

#Units below the split of the Central African hunter-gather lineage (negative value indicates distance above). 

#Units along the Central African hunter-gather lineage (negative values indicate distances along an adjacent edge). 
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Study description 


Research sample 


Sampling strategy 


Data collection 


Timing and spatial scale 


Data exclusions 
Reproducibility 
Randomization 


Blinding 


Genetic analyses were performed on DNA data generated from ancient human skeletons. Population genetic statistics, primarily 
testing historical relationships by measuring allele-sharing patterns across populations, were computed using genome-wide SNP 
genotypes. 


Four ancient human individuals buried at Shum Laka in Cameroon; 63 newly reported present-day individuals from five populations 
in Cameroon (for comparative analyses); ancient African individuals from Gallego Llorente et al. 2015, Schlebusch et al. 2017, and 
Skoglund et al. 2017; present-day African individuals from Mallick et al. 2016 and Fan et al. 2019. 

We sampled available bones from six Shum Laka individuals and obtained working data from four. We targeted approximately 1.2 
million genome-wide SNPs, which effectively cover almost all independent loci (due to linkage disequilibrium) and provide good 


power in population history analyses (all remains in this study yielding good coverage). 


DNA from the ancient remains was extracted, sequenced, and processed into SNP genotype calls (final data overseen by D.R. and 
S.Mal.). 


Ancient individuals were sampled from the Shum Laka rockshelter in western Cameroon. Newly reported present-day individuals 
were from western and southwestern Cameroon. 


Two of the sampled skeletons did not yield working data as assessed by pre-established ancient DNA quality criteria. 


All attempts to reproduce were successful. 


The only subgroup assignments were by date for the ancient individuals and by ethno-linguistic group for the present-day individuals. 


Analyses were performed either for all four individuals separately, all four grouped together, or for the two temporally separated 


pairs; other sample-specific features were not relevant to results. 
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Policy information about studies involving human research participants 


Population characteristics Samples were collected at market places or other locations of a similar nature where many individuals of the ethnic group of 
interest were expected to reside or assemble. Samples were only collected from males who declared themselves to be 18 years 
of age or older and not the sibling of another donor. The self-declared ethnic identity of the donor was recorded after the 


sample was taken, but samples were collected without discrimination on the basis of ethnic identity. There was no genotypic 
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data on donors available to collectors; subsequently, for analysis, we confirmed that none of the individuals were substantial 
genetic outliers relative to their ethno-linguistic groups. 


Recruitment Participants were recruited by public announcement at the sampling location with samples taken in the order donors presented 
themselves. No monetary reward was offered. 
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Since its introduction, the reward prediction error theory of dopamine has explained 

a wealth of empirical phenomena, providing a unifying framework for understanding 
the representation of reward and value in the brain’ *. According to the now canonical 
theory, reward predictions are represented as a single scalar quantity, which supports 
learning about the expectation, or mean, of stochastic outcomes. Here we propose an 


account of dopamine-based reinforcement learning inspired by recent artificial 
intelligence research on distributional reinforcement learning*°. We hypothesized 
that the brain represents possible future rewards not as a single mean, but instead asa 
probability distribution, effectively representing multiple future outcomes 
simultaneously and in parallel. This idea implies a set of empirical predictions, which 
we tested using single-unit recordings from mouse ventral tegmental area. Our 
findings provide strong evidence for a neural realization of distributional 
reinforcement learning. 


The reward prediction error (RPE) theory of dopamine derives from 
work in the artificial intelligence (Al) field of reinforcement learning 
(RL)’. Since the link to neuroscience was first made, however, RL has 
made substantial advances®”’, revealing factors that greatly enhance 
the effectiveness of RL algorithms”. In some cases, the relevant mecha- 
nisms invite comparison with neural function, suggesting hypotheses 
concerning reward-based learning in the brain” ”. Here we examine a 
promising recent development in Al research and investigate its poten- 
tial neural correlates. Specifically, we consider a computational frame- 
work referred to as distributional reinforcement learning* ® (Fig. 1a, b). 

Similar to the traditional form of temporal-difference RL—on which 
the dopamine theory was based—distributional RL assumes that 
reward-based learning is driven by aRPE, which signals the difference 
between received and anticipated reward. (For simplicity, we introduce 
the theory in terms of a single-step transition model, but the same 
principles hold for the general multi-step (discounted return) case; 
see Supplementary Information.) The key difference in distributional 
RL lies in how ‘anticipated reward’ is defined. In traditional RL, the 
reward prediction is represented as a single quantity: the average over 
all potential reward outcomes, weighted by their respective probabili- 
ties. By contrast, distributional RL uses a multiplicity of predictions. 
These predictions vary in their degree of optimism about upcoming 
reward. More optimistic predictions anticipate obtaining greater 
future rewards; less optimistic predictions anticipate more meager 
outcomes. Together, the entire range of predictions captures the full 
probability distribution over future rewards (more details in Supple- 
mentary Information). 

Compared with traditional RL procedures, distributional RL can 
increase performance in deep learning systems by a factor of two 
or more>, an effect that stems in part from an enhancement of 


representation learning (see Extended Data Figs. 2, 3 and Supplemen- 
tary Information). This prompts the question of whether RL inthe brain 
might leverage the benefits of distributional coding. This question is 
encouraged both by the fact that the brain utilizes distributional codesin 
numerous other domains’, and by the fact that the mechanism of distri- 
butional RL is biologically plausible*”. Here we tested several predictions 
of distributional RL using single-unit recordings inthe ventral tegmental 
area (VTA) of mice performing tasks with probabilistic rewards. 


Value predictions vary among dopamine neurons 


Incontrastto classical temporal-difference (TD) learning, distributional 
RL posits a diverse set of RPE channels, each of which carries a different 
value prediction, with varying degrees of optimism across channels. 
(Value is formally defined in RL as the mean of future outcomes, but here 
we relax this definition to include predictions about future outcomes 
that are not necessarily the mean.) These value predictions in turn pro- 
vide the reference points for different RPE signals, causing the latter to 
also differ in terms of optimism. As a surprising consequence, a single 
reward outcome can simultaneously elicit positive RPEs (within relatively 
pessimistic channels) and negative RPEs (within more optimistic ones). 

This translates immediately into a neuroscientific prediction, which 
is that dopamine neurons should display such diversity in ‘optimism’. 
Suppose an agent has learned that a cue predicts a reward whose mag- 
nitude will be drawn from a probability distribution. In the standard 
RL theory, receiving a reward with magnitude below the mean of this 
distribution will elicit a negative RPE, whereas larger magnitudes 
will elicit positive RPEs. The reversal point—the magnitude at which 
prediction errors transition from negative to positive—in standard 
RL is the expectation of the magnitude’s distribution. By contrast, in 
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Fig. 1| Distributional value coding arises from a diversity of relative scaling 
of positive and negative prediction errors. a, Inthe standard temporal- 
difference (TD) theory of the dopamine system, all value predictors learn the 
same value V. Each dopamine cell is assumed to have the same relative scaling 
for positive and negative RPEs (left). This causes each value prediction (or value 
baseline) to be the mean of the outcome distribution (middle). Dotted lines 
indicate zero RPE or pre-stimulus firing. b, Inour proposed model, 
distributional TD, different channels have different relative scaling for positive 


distributional RL, the reversal point differs across dopamine neurons 
according to their degree of optimism. 

Wetested for such reversal-point diversity in optogenetically verified 
dopaminergic VTA neurons, focusing on responses to receipt of liquid 
rewards, the volume of which was drawn at random on each trial from 
seven possible values (Fig. 1c). As anticipated by distributional RL, but not 
by the standard theory, we found that dopamine neurons had substan- 
tially different reversal points, ranging from cells that reversed between 
the smallest two rewards to cells that reversed between the largest two 
rewards (Fig. 2a, b). This diversity was not owing to noise, as the reversal 
point estimated on a random half of the data was a robust predictor 
of the reversal point estimated on the other half of the data (R = 0.58, 
P=1.8 x 10° by linear regression; Fig. 2c). In fact, in response to the 5 pl 
reward, 13 out of 40 cells had significantly above-baseline responses and 
10 out of 40 cells had significantly below-baseline responses. Note that 
while some cells appeared pessimistic and others appeared optimistic, 
there was alsoa population of cells with approximately neutral responses, 
as predicted by the distributional RL model (compare with Fig. 2a, right). 

A stronger test of our theory is whether this diversity also exists 
within a single animal. Most animals had too few cells for analysis, but 
within the single animal with the highest number of recorded cells, 
reversal points estimated on half of the data were robustly predictive 
of reversal points estimated on the other half (P=0.008). Furthermore, 
in response to a single reward magnitude (5 pl), 6 out of 16 cells had 
significantly above-baseline responses and 5 out of 16 cells had sig- 
nificantly below-baseline responses. Finally, Fig. 2d shows rasters of 
two example cells from this animal, exhibiting consistently opposite 
responses to the same reward. 

Because the diversity we observe is reliable across trials, it cannot 
be explained by adding measurement noise to non-distributional TD 
models. As detailed in section 2 of the Supplementary Information (see 
also Extended Data Fig. 4), we also analysed several more elaborate 
alternative models, and whereas some of these can give rise to the 
appearance of reversal-point diversity under some analysis methods, 
the same models are contradicted by other aspects of the experimental 
data, which we report below. 

Our first prediction dealt with the relationship between dopaminergic 
signalling and reward magnitude; dopaminergic RPE signals also scale 
with reward probability”, and distributional RL also leads toa predic- 
tioninthis domain. Pursuing this, we analysed data froma second task 
in which sensory cues indicated the probability of an upcoming liquid 
reward (Fig. 1d). One cue indicated a 10% probability of reward, a differ- 
ent cue indicated a 50% probability, and a third a 90% probability. The 
standard RPE theory predicts that, considering responses at the time the 
cue is presented, all dopamine neurons should have the same relative 
spacing between 10%, 50% and 90% cue responses. (Under neutral risk 
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(a*) and negative (a) RPEs. Red shading indicates a’ > a, and blue shading 
indicates a >a*. Animbalance between a’ anda causes eachchanneltolearna 
different value prediction. This set of value predictions collectively represents 
the distribution over possible rewards. c, We analyse data from two tasks. Inthe 
variable-magnitude task, there is a single cue, followed by areward of 
unpredictable magnitude. d, Inthe variable-probability task, there are three 
cues, which each signal a different probability of reward, andthe reward 
magnitude is fixed. 


preferences, the 50% cue response should be midway between the 10% 
and 90% cues. Under different risk preferences, the 50% cue response 
might be at a different position between 10% and 90%, but it should 
be the same for all neurons). Distributional RL predicts, instead, that 
dopamine neurons should vary in their responses to the 50% cue: some 
neurons should respond optimistically, emitting a RPE nearly as large as 
to the 90% cue. Others should respond pessimistically, emitting a RPE 
closer to the 10% cue response (Fig. 3a). Labelling these two cases as 
optimistically and pessimistically biased, respectively, distributional RL 
predicts that as a population, dopamine neurons should show concur- 
rent optimistic and pessimistic coding for reward probability. 

To test this prediction, we analysed responses of dopaminergic VTA 
neuronsinthe cued probability task just described (see Methods for more 
details). As predicted by distributional RL, but not by the standard theory, 
dopamine neurons differed in their patterns of response across the three 
reward-probability cues, with both optimistic and pessimistic probability 
coding observed (Fig. 3b left, Extended Data Figs. 6, 7). Again, this diver- 
sity was not due to noise, as 10 out of 31 cells were significantly optimistic 
and 9 out of 31 cells were significantly pessimistic, ata P< 0.05 threshold 
(see Methods). By comparison, at a 0.05 threshold, approximately 3 out 
of 31 cells in anon-distributional TD system are expected by chance to 
appear either significantly optimistic or pessimistic. Atthe group level, 
the null hypothesis of no diversity was rejected by one-way analysis of 
variance (ANOVA) (F(30, 3335) = 4.31, P= 6 x 10“). Notably, both forms 
of probability coding were observed side by side in individual animals. 
Inthe animal with the largest number of recorded cells, 4 out of 17 cells 
were consistently optimistic and 5 out of 17 cells were consistently pessi- 
mistic. This was also significant by ANOVA (F(15, 1652) =4.02, P=3 x10”). 

Because most cells were recorded in different sessions, it was 
important to examine whether global changes in reward expecta- 
tions between sessions might explain the observed diversity in opti- 
mism. To this end, we analysed patterns of anticipatory licking. Here 
we found that, although within-session fluctuations in licking were 
predictive of within-session fluctuations in dopamine cell firing, there 
was no relationship between optimism and licking ona cell-by-cell 
basis (Extended Data Fig. 9). This observation makes it unlikely that 
the diverse responses we observed in dopamine neurons are explained 
by session-to-session variability in global reward expectation. That 
interpretation is further undermined by the fact that reversal-point 
diversity was observed inthe one case where several cells were recorded 
simultaneously in one animal (Fig. 3c and Supplementary Information). 


GABAergic neurons make diverse reward predictions 


In distributional RL, diversity in RPE signalling arises because differ- 
ent RPE channels listen to different reward predictions, which vary 
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Fig. 2| Different dopamine neurons consistently reverse from positive to 
negative responses at different reward magnitudes. Variable-magnitude 
task from ref. °°. On each trial, the animal experiences one of seven possible 
reward magnitudes (0.1, 0.3, 1.2,2.5,5,10 or 20 pl), selected randomly. a, RPEs 
produced by classical and distributional TD simulations. Each horizontal bar is 
one simulated neuron. Each dot colour corresponds toa particular reward 
magnitude. Thex axis is the cell’s response (change in firing rate) when reward 
is delivered. Cells are sorted by reversal point. In classical TD, all cells carried 
approximately the same RPE signal. Note that the slight differences between 
cells arose from Gaussian noise added to the simulation; the differences 
between cells inthe classical TD simulation were not statistically reliable. 
Conversely, in distributional TD, cells had reliably different degrees of 
optimism. Some responded positively to almost all rewards, and others 


in their degree of optimism. From a neuroscientific perspective, it 
should thus be possible to track the effects we have identified at the 
level of VTA dopamine neurons back to upstream neurons signal- 
ling reward predictions. Previous work strongly suggests that VTA 
GABAergic (y-aminobutyric acid) neurons have precisely this role, 
and that the reward prediction used to compute the RPE is reflected 
in their firing rates”. Therefore, we predicted that, in the same task 
described above, the population of VTA GABAergic neurons should 
also contain concurrent optimistic and pessimistic probability cod- 
ing. As predicted, consistent differences in probability coding were 
observed across putative GABAergic neurons, again with concurrent 
optimism and pessimism (Fig. 3b, right). In the animal with the largest 
number of cells recorded, 12 out of 36 cells were consistently opti- 
mistic and 11 out of 36 cells were consistently pessimistic (example 
cells shown in Fig. 3d). 


A Firing rate (normalized) 


Reversal point in half of data Time from reward onset (ms) 


responded positively to only the very largest reward. b, Responses recorded 
from light-identified dopamine neurons in behaving mice. Neurons differed 
markedly in their reversal points. c, To assess whether this diversity was 
reliable, we randomly partitioned the data into two halves and estimated 
reversal points independently in each half. We found that the reversal point 
estimated in one half was correlated with that estimated in the other half 
(P=1.8 x 10> by linear regression). d, Spike rasters for two example dopamine 
neurons from the same animal, showing responses to all trials when the 5 pl 
reward was delivered. We analysed data from 200 to 600 ms after reward onset 
(highlighted), to exclude the initial transient that was positive for all 
magnitudes. During this epoch, the cell onthe bottom fires above its baseline 
rate, while the cell onthe top pauses. 


Distribution coding from asymmetric RPE scaling 
The results reported in the preceding sections suggest that a distri- 
bution of value predictions is coded in the neural circuits underlying 
RL. How might such coding arise in the first place? Recent Al work on 
distributional RL® has shown that distributional coding arises automati- 
cally ifa single change is made to the classical TD learning mechanism. 
In classical TD, positive and negative errors are given equal weight. 
As aresult, positive and negative errors are in equilibrium when the 
learned prediction equals the mean of the reward distribution. There- 
fore, classical TD learns to predict the average over future rewards. 
Bycontrast, in distributional TD, different RPE channels place different 
relative weights on positive versus negative RPEs (see Fig. 1b). In channels 
that overweight positive RPEs, reaching equilibrium requires these positive 
errors to become less frequent, so the learning dynamics converge ona 
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Fig. 3 | Optimistic and pessimistic probability coding occur concurrently in 
dopamine and VTA GABAergic neurons. Data from variable-probability task. 
a, Histogram (across simulated cells) of t-statistics which compare each cell’s 
50% cue response against the mean 50% cue response across cells. Qualitatively 
identical results hold when comparing the 50% cue response against 

the midpoint of 10% and 90% responses. The superimposed black curve shows 
the t-distribution with the corresponding degrees of freedom. Distributional 
TD predicts simultaneous optimistic and pessimistic coding of probability, 
whereas classical TD predicts that all cells have the same coding. Colour 
indicates the degree of optimism or pessimism. b, Same asa, but using data 


from real dopamine and putative GABAergic neurons. The pattern of results 
closely matches the predictions from the distributional TD model. 

c, Responses of four example dopamine neurons recorded simultaneously ina 
single animal. Each trace is the average response to one of the three cues. 
Shaded area showss.e.m. Time zero is the onset of the odour cue. Some cells 
code the 50% cue similarly to the 90% cue, while others simultaneously code it 
similarly to the 10% cue. Grey areas show epoch averaged for summary 
analyses. d, Responses of two example VTA GABAergic cells from the same 
animal. 
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Fig. 4 | Relative scaling of positive and negative dopamine responses 
predicts reversal point. a, Three simulated dopamine neurons—each witha 
different asymmetry—in the variable-magnitude task. For each unit, we 
empirically estimated the reversal point where responses switch from negative 
to positive. The x axis shows reward minus the per-cell reversal point, 
effectively aligning each cell’s responses to its respective reversal point. 
Baseline-subtracted response to reward is plotted on the y axis. Responses 
below the reversal point are shown in green and those above are shownin 
orange. Solid curves show linear functions fit separately to the above-reversal 
and below-reversal domains of each cell. b, Same asa, but showing three real 
example dopamine cells. c, The diversity in relative scaling of positive and 
negative responses in dopamine cells is statistically reliable (one-way ANOVA; 
F(38, 234) =2.93, P=4 x10”). The mean and 95% confidence intervals of 


more optimistic reward prediction. Conversely, in channels overweighting 
negative RPEs, amore pessimistic prediction is needed to attain equilibrium 
(Fig. 4a, Extended Data Fig. 1a). Together, the set of predictions learned 
across all channels encodes the full shape of the reward distribution. 

When distributional RL is considered as a model of the dopamine sys- 
tem, these points translate into two testable predictions. First, dopamine 
neurons should differ in their relative scaling of positive and negative 
RPEs. To test this prediction, we analysed activity from VTA dopamine 
neurons in the variable-magnitude task described above. We first esti- 
mated a reversal point for each cell as previously described. Then, foreach 
cell, we separately estimated two slopes: a’ for responses in the positive 
domain (that is, above the reversal point), and a@ forthenegative domain 
(Fig. 4b). This revealed reproducible differences across dopamine neurons 
inthe relative magnitude of positive versus negative RPEs (Extended 
Data Fig. 5). Across all animals, the mean value of the ratio a’/(a’* +a) 
was 0.48. However, many cells had a value significantly above or below 
this mean (Fig. 4c; see Methods for details of statistical test). Atthe group 
level, there was significant diversity between cells by one-way ANOVA 
(F(38, 234) = 2.93, P=4 x10”). In the animal with the largest number of 
recorded cells, 3 out of 15 cells were significantly below the mean and 3 
out of 15 were significantly above the mean; ANOVA again rejected the 
null hypothesis of no diversity between cells (F(14, 90) =4.06, P=2x10°). 

Second, RPE asymmetry should correlate, across dopamine neu- 
rons, with reversal point. Dopamine neurons that scale positive RPEs 
more steeply relative to negative RPEs should be linked with relatively 
optimistic reward predictions, and so should have reversal points at 
relatively high reward magnitudes. Dopamine neurons that scale posi- 
tive RPEs less steeply should have relatively low reversal points. Again 
using data from the variable-magnitude task, we found a strong cor- 
relation between RPE asymmetry and reversal point (P= 8.1 x 10° by 
linear regression; Fig. 4d, e), validating this prediction. Furthermore, 
this effect survived when only considering data from the single animal 
with the largest number of recorded cells (P= 0.002). 


Decoding reward distributions 

As we have discussed, the distributional TD model correctly predicts 
that dopamine neurons should show diverse reversal points and 
response asymmetries, and that these should correlate. Finally, we 
consider the most detailed prediction of the model. The specific rever- 
sal points observed in any experimental situation, together with the 
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a’/(a’ +a) are displayed, where a’ anda@ are the slopes estimated above. 

d, Relative scaling of positive and negative responses predicts that cell’s 
reversal point (P=8.1x 10° by linear regression). Each point represents one 
dopamine cell. Dashed line is the mean over cells. Light grey traces show 
reversal points measured in distributional TD simulations of the same task, and 
show variability over simulation runs. e, All40 dopamine cells plotted inthe 
same fashion as in b, except normalized by the slope estimated in the negative 
domain. Thus, the observed variability in slope in the positive domain 
corresponds to diversity in relative scaling of positive and negative responses. 
Cells are coloured by reversal point, to illustrate the relationship between 
reversal point and asymmetric scaling. In all panels, reward magnitudes are in 
estimated utility space (see Methods). 


particular response asymmetries in the corresponding neurons, should 
encode an approximate representation of the anticipated probability 
distribution over future rewards. 

If this is the case, then with sufficient data it should be possible to 
decode the full value distribution from the responses of dopamine neu- 
rons. As a final test of the distributional RL hypothesis, we attempted 
this type of decoding. The distributional TD model implies that, if 
dopaminergic responses are approximately linear in the positive and 
negative domains, then the resultant learned reward predictions will 
correspond to expectiles of the reward distribution” (expectiles are 
astatistic of distributions, which generalize the mean in the same way 
that quantiles generalize the median). 

We therefore treated the reversal points and response asymmetries 
measured in the variable-magnitude task as defining a set of expec- 
tiles, and we transformed these expectiles into a probability density 
(see Methods). As shown in Fig. 5a—c, the resulting density captured 
multiple modes of the ground-truth value distribution. Decoding the 
RPEs produced by a distributional TD simulation, but not a classical 
TD simulation, produced the same pattern of results. 

Parallel analyses focusing on the variable-probability task (see Meth- 
ods) yielded similarly good matches to the ground-truth distributions 
in that task (Fig. 5d, e). In both tasks, successful decoding depended 
onthe specific pattern of variability in the neural data, and not on the 
presence of variability per se (Extended Data Fig. 8). 

Itis worth emphasizing that none of the effects we have reported are 
anticipated by the standard RPE theory of dopamine, which implies that 
all dopamine neurons should transmit essentially the same RPE signal. 
Why have the present effects not been observed before? In some cases, 
relevant data have been hiding in plain sight. For example, anumber of 
studies have reported marked variability in the relative magnitude of 
positive and negative RPEs across dopamine neurons; however, they 
have treated this as an incidental finding or a reflection of measure- 
ment error, or viewed it as a problem for the RPE theory”. One of the 
earliest studies of reward-probability coding in dopaminergic RPEs 
remarked on apparent diversity across dopamine neurons, but onlyina 
footnote’. A more general issue is that the forms of variability we have 
reported are masked by traditional analysis techniques, which typically 
focus on average responses across dopamine neurons (see Supplemen- 
tary Information and Extended Data Fig. 10). 

Distributional RL offers a range of untested predictions. Dopamine 
neurons should maintain their ordering of relative optimism across task 
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Fig. 5| Decoding reward distributions from neural responses. 

a, Distributional TD simulation trained on the variable-magnitude task, whose 
actual (smoothed) distribution of rewards is shown in grey. After training the 
model, we interpret the learned values as a set of expectiles. We then decode 
the set of expectiles into a probability density (blue traces). Multiple solutions 
are shown in light blue, and the average across solutions is shown in dark blue. 
(See Methods for more details.) b, Same asa, but witha classical TD simulation. 
c, Same asa, but using data from recorded dopamine cells. The expectiles are 
defined by the reversal points and the relative scaling from the slopes of 


contexts, even as the specific distribution of rewards changes. If RPE 
channels with particular levels of optimism are selectively activated 
with optogenetics, this should sculpt the learned distribution, which 
should in turn be detectable with behavioural measures of sensitivity 
to moments of the distribution. We list further predictions in the Sup- 
plementary Information. 

Distributional RL also gives rise to a number of broader questions. 
What are the circuit- or cellular-level mechanisms that give rise toa 
diversity of asymmetry in positive versus negative RPE scaling? It is 
also worth considering whether other mechanisms, aside from asym- 
metric scaling of RPEs, might contribute to distributional coding. It is 
well established, for example, that positive and negative RPEs differ- 
entially engage striatal D, and D, dopamine receptors”, and that the 
balance of these receptors varies anatomically”. This suggests a sec- 
ond potential mechanism for differential learning from positive versus 
negative RPEs”. Moreover, how do different RPE channels anatomically 
couple with their corresponding reward predictions (see Extended 
Data Fig. 4i-k)? Finally, what effects might distributional coding have 
downstream, at the level of action learning and selection? With this 
question in mind, it is notable that some current theories in behavioural 
economics centre on risk measures that can be easily read out from 
the kind of distributional codes that the present work has considered. 

Finally, we speculate on theimplications of the distributional hypothesis 
of dopamine forthe mechanisms of mental disorders suchas addiction and 
depression. Mood has been linked with predictions of future reward”, and 
ithas been proposed that both depression and bipolar disorder may involve 
biased forecasts concerning value-laden outcomes”. It has recently been 
proposed that such biases may arise from asymmetries in RPE coding’. 
There are clear potential connections between these ideas andthe phenom- 
enawe have reported here, presenting opportunities for further research. 
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Methods 


Distributional RL model 

The model for distributional RL we use throughout the work is based 
onthe principle of asymmetric regression and extends recent results 
in AP*S, We present a more detailed and accessible introduction to 
distributional RL in the Supplementary Information. Here we outline 
the method in brief. 

Let f: IR > Rbearesponse function. In each observed state x, let there 
beaset of value predictions V(x) which are updated with learning rates 
aj, a; € R*. Then given astate x, next-state x’, resulting reward signal r 
and time discount y € [0, 1), the distributional TD model computes 
distributional TD errors 


§=1+ yV(x’) - Yo) (1) 


where V,(x’) is a sample from the distribution V(x’). The model then 
updates the baselines with 


V(x) < Vix) + aff(6) for 6,>0 (2) 


V(x) < V(x) + af (6,) for 6; < 0 (3) 


When performed with a tabular representation, asymmetry uni- 
formly distributed, and f(6) = sgn(6), this method converges to the T; 
quantile, 7,= ae of the distribution over discounted returns at x 
(ref. °). Similarly, asymmetric regression with response function f(6) =6 
corresponds to expectile regression”. Like quantiles, expectiles fully 
characterize the distribution and have been shown to be particularly 
useful for measures of risk”. 

Finally, we note that throughout the paper, we use the terms opti- 
mistic and pessimistic to refer to return predictions that are above or 
below the mean (expected) return. Importantly, these predictions are 
optimistic inthe sense of corresponding to particularly good outcomes 
from the set of possible outcomes. They are not optimistic inthe sense 
of corresponding to outcomes that are impossibly good. 


Artificial agent results 

Atari results are on the Atari-57 benchmark using the publicly avail- 
able Arcade Learning Environment. This is a set of 57 Atari 2600 
games and human-performance baselines. Refer to previous work 
for details on deep Q-networks (DON) and computation of human- 
normalized scores®. The distributional TD agent uses our proposed 
model and a DON with multiple (n = 200) value predictors, each with 
a different asymmetry, spaced uniformly in [0, 1]. The training objec- 
tive of DON, the Huber loss, is replaced with the asymmetric quantile- 
Huber loss, which corresponds to the x-saturating response function 
f(6) =max(min(6, x), -K), with =1. 

Finally, at each update we train all channels based onthe immediate 
reward and the predicted future returns from all next-state value predic- 
tors. Further details can be found in ref. °. The physics-based motor- 
control task requires control of a 28 degrees-of-freedom humanoid 
tocompletea 3D obstacle course in minimal time”. Full details for the 
D3PG and distributional D3PG agents are as described™. Distributions 
over return shown in Extended Data Fig. 2d, fare based on the network- 
predicted distribution in each of the given frames. 


Tabular simulations 

Tabular simulations of the classical TD and distributional TD models 
used a population of learning rates selected uniformly at random, 
aj ~ U(O, 1) for each cell i. In all cases the only algorithmic difference 
between the classical and distributional TD models was that the dis- 
tributional model used a separately varying learning rate for negative 
prediction errors, a; ~ U(O, 1)for each cell i. Both methods used a linear 
response function. Qualitatively similar results were also obtained 


with other response functions (for example, Hill function” or x-satu- 
rating), despite these leading to semantically different estimators of 
the distribution. The population sizes were chosen for clarity of pres- 
entation and to provide similar variability as observed in the neuronal 
data. Each cell was paired with a different state-dependent value esti- 
mate V(x). Note that while these simulations focused on immediate 
rewards, the same algorithm also learns distributions over multi-step 
returns. 

In the variable-probability task, each cue corresponded to a differ- 
ent value estimate and reward probability (90%, 50% or 10%). When 
rewarded, the agent received numerical reward of 1.0, and when omit- 
ted, it received 0.0. Both agents were trained for 100 trials of 5,000 
updates, and both simulated n=31 cells (separate value estimates). The 
learning rates were all selected uniformly at random between [0.001, 
0.2]. Cue response was taken to be the temporal difference froma con- 
stant zero baseline to the value estimate. 

Inthe variable-magnitude task, all rewards were taken to be the water 
magnitude measured in microlitres (qualitatively same results obtained 
with utilities instead of magnitudes). For Fig. 2we ran 10 trials of 25,000 
updates each for 150 estimators with random learning rates in [0.001, 
0.02]. These smaller learning rates and larger number of updates were 
intended to ensure the values converged fully with low error. We then 
report temporal difference errors for ten cells taken uniformly to span 
the range of value estimates for each agent. Reported errors (simulating 
change in firing rate) are the utility of a reward minus the value estimate 
and scaled by the learning rate. As with the neuronal data, these are 
reported averaged over trials and normalized by variance over reward 
magnitudes. Distributional TD RPEs are computed using asymmetric 
learning rates, with a small constant (floor) added to the learning rates. 


Distribution decoding 
For bothreal neural data and TD simulations, we performed distribution 
decoding. The distributional and classical TD simulations used for 
decoding inthe variable-magnitude task each used 40 value predictors, 
to match the 40 recorded cells in the neural data (neural analyses were 
pooled across the six animals). In the distributional TD simulation, each 
value predictor used a different asymmetric scaling factor 7, = re 
and therefore learned a different value prediction V,. a 
The decoding analyses began with a set of reversal points, V;, and 
asymmetric scaling factors T;. For the neural data, these were obtained 
as described elsewhere. For the simulations, they were read directly 
from the simulation. These numbers were interpreted as a set of expec- 
tiles, with the 7,-th expectile having value V;. We decoded these into 
probability densities by solving an optimization problem to find the 
density most compatible with the set of expectiles”°. For optimization, 
the density was parameterized as a set of samples. For display in Fig. 5, 
the samples are smoothed with kernel density estimation. 


Animals and behavioural tasks 

The rodent data we re-analysed here were first reported in ref.°. Meth- 
ods details can be found in that paper and in ref. °°. We give a brief 
description of the methods below. 

Five mice were trained on a ‘variable-probability’ task, and six differ- 
ent mice ona ‘variable-magnitude'’ task. In the variable-probability task, 
in each trial the animal first experienced one of four odour cues for 1s, 
followed by a1-s pause, followed by a reward (3.75 pl water), an aversive 
airpuff or nothing. Odour 1 signalled a 90% chance of reward, odour 
2 signalled a 50% chance of reward, odour 3 signalled a 10% chance of 
reward and odour 4 signalled a 90% chance of airpuff. Odour meanings 
were randomized across animals. Inter-trial intervals were exponentially 
distributed. 

Aninfrared beam was positioned in front of the water delivery spout, 
and each beam break was recorded as one lick event. We report the aver- 
age lick rate over the entire interval between the cue and the outcome 
(that is, O-2,000 ms after cue onset). 


In the variable-magnitude task, in 10% of trials an odour cue was 
delivered that indicated that no reward would be delivered on that 
trial. In the remaining 90% of trials, one of the following reward mag- 
nitudes was delivered, at random: 0.1, 0.3, 1.2,2.5,5,10 or 20 pl. In half 
of these trials, this reward was preceded by 1,500 ms by an odour cue 
(which indicated that a reward was forthcoming but did not disclose 
its magnitude). In the other half, it was unsignalled. 

In order to identify dopamine neurons while recording, neurons 
in the VTA were tagged with channelrhodopsin-2 (ChR2) by injecting 
adeno-associated virus (AAV) that expresses ChR2 ina Cre-dependent 
manner into the VTA of transgenic mice that express Cre recombinase 
under the promoter of the dopamine transporter (DAT) gene Slc6a3 
(B6.SJL-Slc6a3tm1.1(cre)Bkmn/J, The Jackson Laboratory)*°. Mice were 
implanted with a head plate and custom-built microdrive containing 
6-8 tetrodes (Sandvik) and optical fibre, as described”. 

All experiments were performed in accordance with the US National 
Institutes of Health Guide for the Care and Use of Laboratory Animals 
and approved by the Harvard Institutional Animal Care and Use Com- 
mittee. 


Neuronal data and analysis 

Extracellular recordings were made from VTA using a data acquisi- 
tion system (DigiLynx, Neuralynx). VTA recording sites were verified 
histologically. The identity of dopaminergic cells was confirmed by 
recording the electrophysiological responses of cells to a brief blue 
light pulse train, which stimulates only DAT-expressing cells. Spikes 
were sorted using SpikeSort3D (Neuralynx) or MClust-3.5 (A.D. Redish). 
Putative GABAergic neurons in the VTA were identified by clustering 
of firing patterns as described previously”. All confidence intervals 
are s.e.m. unless otherwise noted. 

Data analyses were performed using NumPy 1.15 and MATLAB R2018a 
(Mathworks). Spike times were collected in 1-ms bins to create peri- 
stimulus time histograms. These histograms were then smoothed by 
convolving with the function (1-e"‘) - e “/7 where Twas atime constant, 
set to 20 msas in ref. *°. For single-cell traces, we set Tto 200 ms for 
display purposes. 

After smoothing, the data were baseline-corrected by subtracting 
from each trial and each neuron independently the mean over that trial’s 
activity from —1,000 to O ms relative to stimulus onset (or relative to 
reward onset in the unexpected reward condition). 


Variable-probability task. n = 31 cells were recorded from five ani- 
mals, with the following number of cells per animal: 1, 4,16, land 9. 
Responses to cue for dopamine neurons were defined as the average 
activity from 0 to 400 ms after cue onset. This interval was chosen to 
match ref. °°. Responses to cue for putative GABAergic neurons were 
defined as the average activity from 0 to 1,500 ms after cue onset. This 
longer interval was chosen because these neurons had much slower 
responses, often ramping up slowly over the first 500 or 1,000 ms after 
cue onset” (Fig. 3d). 

We were interested in whether there was between-cell diversity 
in responses to the 50% cue. We first normalized the responses 
to the 50% cue ona per-cell basis as follows: c36'™= (C5) — Mean(C4))/ 
(Mean(Cogo) — Mean(cj9)) , where mean indicates the mean over trials 
within a cell. In order to be agnostic about the risk preferences of the 
animal, we then performed a two-tailed t-test of the cell’s normalized 
responses to the 50% cue against the average of all cells’ normalized 
responses to the 50% cue. This is the test for optimistic or pessimistic 
probability coding that we report in the main text. Note that these 
t-statistics would be t-distributed if the differences between cells were 
dueto chance. Wealso report ANOVA results where we evaluate the null 
hypothesis that all cells’ normalized 50% responses have the same mean. 

The same pattern of results held when instead comparing responses 
to the 50% cue against the midway point between responses to the 10% 
cue and responses to the 90% cue. 


The per-cell cue responses shown in Extended Data Fig. 7 were nor- 
malized to zero mean and unit variance, to allow direct comparison of 
cells with different response variability. Each cell appears in one of three 
panels based onthe outcome of two single-tailed Mann-Whitney tests 
evaluating the rank order for Ci) < C59 aNd C59 < Cog (See SUPplementary 
Information section 3.3 for further details). The left, centre and right 
panels correspond to outcomes (P = 0.05, P< 0.05), (P< 0.05, P< 0.05 
or P>0.05, P= 0.05) and (P< 0.05, P > 0.05), respectively. 


Variable-magnitude task. n = 40 cells were recorded from five ani- 
mals, with the following number of cells per animal: 3, 6, 9, 16 and 6. 
Responses to reward were defined as the average activity from 200 to 
600 ms after reward onset. This time interval was selected to match 
ref.*° as closely as possible, while excluding the initial response to the 
feeder click*°*”’, which was not selective to reward magnitude and was 
positive for all reward magnitudes. This enabled us to find the reward 
magnitudes for which the dopamine response was either boosted or 
suppressed relative to baseline. 

The reversal point (that is, the reward magnitude that would elicit 
neither a positive nor a negative deflection in firing relative to baseline) 
for each cell was defined as the magnitude M, that maximized the num- 
ber of positive responses to rewards greater than M, plus the number 
of negative responses to rewards less than M,. To obtain statistics for 
reliability of cell-to-cell differences in reversal point, we partitioned 
the data into random halves and estimated the reversal point for each 
cell separately in each half. We repeated this procedure 1,000 times 
with different random partitions, and we report the mean R value and 
geometric mean p value across these 1,000 folds. 

After measuring reversal points, we fit linear functions separately to 
the positive and negative domains of each cell. To obtain confidence 
intervals, we divided the data into seven random partitions (seven being 
the smallest number of trials in any condition for any cell), subject to 
the constraint that every condition for every cell contain at least one 
trial in each partition. In each partition, we repeated the procedure 
for estimating reversal points and finding slopes in the positive and 
negative domains. Our confidence interval on T= a"‘/(a’* +a) was then 
thes.e.m. of the values calculated across the seven partitions. ANOVAS 
are also reported testing the null hypothesis that means (across parti- 
tions) were not different between cells. 

Fitting linear functions to dopamine responses was more logical in 
utility space than in reward volume space. We relied on ref. ** to approxi- 
mate the underlying utility function from the dopamine responses 
to rewards of varying magnitudes. We used these empirical utilities 
instead of raw reward magnitudes for the analyses shown in Fig. 4. 
However, none of the reported results were sensitive to this choice of 
utility function. We also ran the analyses using other utility functions, 
and these results are reported in Extended Data Fig. 5. One cell was 
excluded from analyses in Fig. 5: because it had no positive responses 
to any reward magnitude, aslope could not be fit in the positive domain. 

When measuring the correlation (across cells) between reversal 
point andr, we first randomly split the data into two disjoint halves of 
trials. In one half, we first calculated reversal points RP, and used these 
reversal points to calculate a and a. In the other half, we calculated 
reversal points RP,. The correlation we report is between RP, and t=a"/ 
(a* +a’). We did this to avoid confounds associated with using the same 
data to estimate both slopes and intercepts. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 
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https://doi.org/10.17605/OSF.IO/UX5RG. 
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Extended Data Fig. 1| Mechanism of distributional TD. a, The degree of 
asymmetry in positive to negative scale determines the equilibrium where 
positive and negative errors balance. Equal scaling equilibrates at the mean, 
whereas a larger positive (negative) scaling produces an equilibrium above 
(below) the mean. b, Distributional prediction emerges through experience. 


Quantile (sign function) version is displayed here for clarity. Model is trained 
onarbitrary task with trimodal reward distribution. c, Same as b, viewed in 
terms of cumulative distribution (left) or learned value for each predictor 
(quantile function) (right). 
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Extended Data Fig. 2| Learning the distribution of returns improves 
performance of deep RL agents across multiple domains. a, DQN and 
distributional TD share identical nonlinear network structures. b,c, After 
training classical or distributional DON on MsPacman, we freeze the agent and 
then train a separate linear decoder to reconstruct frames from the agent’s 
final layer representation. For each agent, reconstructions are shown. The 
distributional model’s representation allows substantially better 
reconstruction. d, Ata single frame of MsPacman (not shown), the agent’s value 
predictions together representa probability distribution over future rewards. 
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Reward predictions of individual RPE channels shownas tick marks ranging 
from pessimistic (blue) to optimistic (red), and kernel density estimate shown 
in black. e, Atari-57 experiments with single runs of prioritized experience 
replay*° and double DQN“ agents for reference. Benefits of distributional 
learning exceed other popular innovations. f, g, The performance pay-off of 
distributional RL can be seen across a wide diversity of tasks. Here we give 
another example, a humanoid motor-control task in the MuJoCo physics 
simulator. Prioritized experience replay agent is shown for reference”. Traces 
show individual runs; averages are in bold. 
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Extended Data Fig. 3 | Simulation experiment to examine the role of 
representation learning in distributional RL. a, Illustration of tasks Land 2. 
b, Example images for each class used in our experiment” c, Experimental 
results, where each of ten random seeds yields an individual run shown with 
traces; average over seeds is shown in bold. d, Sameasc, but for control 
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experiment. e, Bird-dog t-SNE visualization of final hidden layer of network, 
given different input images (blue, bird; red, dog). Left, classical TD; right, 
distributional TD; top row, representation after training on task 1; bottom row, 
representation after training on task 2. 
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Model 6: Synaptic asymmetries 
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Extended Data Fig. 4 | See next page for caption. 


Extended Data Fig. 4 | Null models. a, Classical TD plus noise does not give rise 
to the pattern of results observed in real dopamine data in the variable- 
magnitude task. When reversal points were estimated in two independent 
partitions there was no correlation between the two (P= 0.32 by linear 
regression). b, We then estimated asymmetric scaling of responses and found 
no correlation between this and reversal point (P= 0.78 by linear regression). 

c, Model comparison between ‘same’, a single reversal point, and ‘diverse’, 
separate reversal points. In both, the model is used to predict whether a held- 
out trial has a positive or negative response. d, Simulated baseline-subtracted 
RPEs, colour-coded according to the ground-truth value of bias added to that 
cell’s RPEs. e, Across all simulated cells, there was a strong positive relationship 
between pre-stimulus baseline firing and the estimated reversal point. f, Two 
independent measurements of the reversal point were strongly correlated. 

g, The proportion of simulated cells that have significantly positive (blue) or 
negative (red) responses showed no magnitudes with both positive and 
negative responses. h, In the simulation, there was a significant negative 
relationship between the estimated asymmetry of each cell and its estimated 


reversal point (opposite that observed in neural data). i, Diagram illustrating a 
Gaussian-weighted topological mapping between RPEs and value predictors. 
j, Varying the standard deviation of this Gaussian modulates the degree of 
coupling. k, Inatask with equal chance of areward 1.0 or 0.0, distributional TD 
with different levels of coupling shows robustness to the degree of coupling. 
I, When there is no coupling, a distributional code is not learned, but 
asymmetric scaling can cause spurious detection of diverse reversal points. 
m, Even though every cell has the same reward prediction they appear to have 
different reversal points. n, With this model, some cells may have significantly 
positive responses, and others significantly negative responses, in response to 
the same reward. o, But this model is unable to explain a positive correlation 
between asymmetric scaling and reversal points. p, Simulation of ‘synaptic’ 
distributional RL, in which learning rates but not firing rates are 
asymmetrically scaled. This model predicts diversity in reversal points 
between dopamine neurons. q, The model predicts no correlation between 
asymmetric scaling of firing rates and reversal point. 
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Extended Data Fig. 5 | Asymmetry and reversal. a, Left, all data points (trials) 
from an example cell. The solid lines are linear fits to the positive and negative 
domains, and the shaded areas show 95% confidence intervals calculated with 
Bayesian regression. Right, the same cell plotted in the format of Fig. 4b. 

b, Cross-validated model comparison on the dopamine data favours allowing 
each cell to have its own asymmetric scaling (P=1.4 x 10 ™ by paired t-test). The 
standard error of the mean appears large relative to the Pvalue because the 
Pvalueis computed using a paired test. c, Although the difference between 
single-asymmetry and diverse-asymmetry models was small in firing-rate 
space, such small differences correspond to large differences in decoded 
distribution space (more details in Supplementary Information). Each pointisa 
TD simulation; colour indicates the degree of diversity in asymmetric scaling 
within that simulation. d, We were interested in whether an apparent 
correlation between reversal point and asymmetry could arise as an artefact, 
owing toa mismatch between the shape of the actual dopamine response 
function and the function used to fit it. Here we simulate the variable- 
magnitude task using a TD model without a true correlation between 
asymmetric scaling and reversal point. We then apply the same analysis 
pipeline as in the main paper, to measure the correlation (colour axis) between 


b 23 Cc 7 T 
8 } ' ° 
! 
22.5 Es ' 
5 25 : . 
5 22 KS F i 
2 2 ot ‘ 
§ 21.5 = Mae 
z <3 
c & ! 
3 7! 82 
F é 
20.5 5 *! 
20 @ OF-=3-- 352+ 
e081 ek = a Fa 
o] sY 
5nd? “yee ° -100 -50 te) 50 100 150 
model % difference in distribution space 
f 20 pL e 
e 
10 pl ° _ e 
« A 
° 
ote * Bop P s 
eo % 2 bs ry . 
fa eas ee ae ea g sr. ft 
eo © oe aad ee of Soo 
2.5 pL ‘ee 
0.1 pl 2 a 
0 0.2 0.4 0.6 0.8 ] 0.4 0.6 0.8 1 
a'/(a*+a°) a*/(a*t+a) 


asymmetric scaling and reversal point. We repeat this procedure 20 times with 
different dopamine response functions in the simulation, and different 
functions used to fit the positive and negative domains of the simulated data. 
The functions are sorted in increasing order of concavity. An artefact can 
emerge if the response function used to fit the datais less concave than the 
response function used to generate the data. For example, when generating 
data with a Hill function but fitting with a linear function, a positive correlation 
can be spuriously measured. e, When simulating data from the distributional 
TD model, where a true correlation exists between asymmetric scaling and 
reversal point, it is always possible to detect this positive correlation, evenif 
the fitting response function is more concave than the generating response 
function. The black rectangle highlights the function used to fit real neural 
datainc. f, Here we analyse the real dopamine cell dataidentically to Fig. 4d, 
but using Hill functions instead of linear functions to fit the positive and 
negative domains. Because the correlation between asymmetric scaling and 
reversal point still appears under these adversarial conditions, wecan be 
confident itis not driven by this artefact. g, Same as Fig. 4d, but using linear 
response function and linear utility function (instead of empirical utility). 
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Extended Data Fig. 6 | Cue responses versus outcome responses, and more 
evidence for diversity. a, In the variable-probability task: firing at cue, versus 
firing at reward (left) or omission (right). Colour brightness denotes 
asymmetry. b, Sameasa, but showing RPEs from distributional TD simulation. 
c, Data from ref.*° also included unpredicted rewards and unpredicted 
airpuffs. Top two panels show responses for all the cells recorded in one animal 
and bottom two panels show responses for all the cells of another animal. Left, 
thex axis is the baseline-subtracted response to free reward and the yaxis is the 
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baseline-subtracted response to airpuff. Dots with black outlines are per-cell 
means, and un-outlined dots are means of disjoint subsets of trials indicating 
consistency of asymmetry. Right, the same data plotted ina different way, with 
cells sorted along the x axis by response to airpuff. Response to reward is shown 
in greyscale dots. Asterisks indicate significant difference in firing rates from 
one or bothneighbouring cells. d, Simulations for distributional but not 
classical TD produce diversity in relative response. 
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Extended Data Fig. 7 | More details of data in variable-probability task. 

a, Details of analysis method. Of the four possible outcomes of the two Mann- 
Whitney tests (Methods), two outcomes correspond to interpolation (middle) 
and oneeach tothe pessimistic (left) and optimistic (right) groups. 

b, Simulation results for the classical TD and distributional TD models. yaxis 
shows the average firing-rate change, normalized to mean zero and unit 
variance, in response to each of the three cues. Each curve is one cell. The cells 
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are split into panels according toa statistical test for type of probability coding 
(see Methods for details). Colour indicates the degree of optimism or 
pessimism. Distributional TD predicts simultaneous optimistic and 
pessimistic coding of probability, whereas classical TD predicts all cells have 
the same coding. c, Sameasb, but using data from real dopamine neurons. The 
pattern of results closely matches the predictions from the distributional TD 
model. d, Sameas b, using data from putative VTA GABAergic interneurons. 
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Extended Data Fig. 8 | Further distribution decoding analysis. This figure 
pertains to the variable-magnitude experiment. a—c, In the decoding shownin 
the main text, we constrained the support of the distribution to the range of the 
rewards in the task. Here, we applied the decoding analysis without 
constraining the output values. We find similar results, although with 
increased variance. d, We compare the quality of the decoded distribution 
against several controls. The real decoding is shown as black dots. In coloured 
lines are reference distributions (uniform and Gaussian with the same mean 
and variance as the ground truth; andthe ground truth mirrored). Black traces 
shift or scale the ground-truth distribution by varying amounts. e, Nonlinear 
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functions used to shift asymmetries, to measure degradation of decoded 
distribution. The normal cumulative distribution function @ is used to 
transform asymmetry T. This is shifted by some values and transformed back 
through the normal quantile function @*. Positive values s increase the value of 
tand negative values decrease the value of tr. f, Decoded distributions under 
different shifts, s.g, Plot of shifted asymmetries for values of s used. 

h, Quantification of match between decoded and ground-truth distribution, 
for eachs.i,j, Same as Fig. 5d, e, but for putative GABAergic cells rather than 


dopamine cells. 
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Extended Data Fig. 9 | Simultaneous diversity. a, b, Variable-probability task. 
Mean spiking (a) and licking (b) activity in response to each of the three cues 
(indicating 10%, 50% or 90% probability of reward) at time 0, andin response to 
the outcome (reward or no reward) at time 2,000 ms.c, Trial-to-trial variations 
in lick rates were strongly correlated with trial-to-trial variations in dopamine 
firing rates. Mean of each cell is subtracted from each axis, and the x axis is 
binned for ease of visualization. d, Dopaminergic coding of the 50% cue relative 
tothe 10% and 90% cues (as shown inb) was not correlated with the same 
measure computed onlick rates. Therefore, between-session differences in 
cue preference, measured by anticipatory licking, cannot explain between-cell 
differences in optimism. e, Four simultaneously recorded dopamine neurons. 


These are the same four cells whose time courses are shown in Fig. 3c. 

f, Variable-magnitude task. Across cells, there was no relationship between 
asymmetric scaling of positive versus negative prediction errors, and baseline 
firing rates (R= 0.18, P=0.29). Each point isacell. These data are from 
dopamine neurons at reward delivery time. g, t-statistics of response to5 pl 
reward compared with baseline firing rate, for all 16 cells from animal D. Some 
cells respond significantly above baseline and others significantly below. Cells 
are sorted by t-statistic. h, Spike rasters showing all trials in which the 5 pl 
reward was delivered. The two panels are two example cells from the same 
animal with rasters shown in Fig. 2. 
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Extended Data Fig. 10 | Relationship of results to original analysis. Here 
we reproduce results for the variable-magnitude task in ref. °° with two 
different time windows. a, Change in firing rate in response to cued reward 
delivery averaged over all cells. b, Comparing Hill-function fit and response 
averaged over all cells for expected (cued) and unexpected reward delivery. 
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c, Correlation between response predicted by scaled common response 
function and actual response to expected reward delivery. d, Zoomingin 
oncshows correlation driven primarily by larger reward magnitudes. 
e-h, Repeating the above analysis for a window of 200-600 ms. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


a The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Simulation experiments were built with custom code and use the following components: Python 2.7 and Tensorflow 1.13.1. Artificial 
agent experiments are based upon previously published methods. 


Data analysis Data analysis was performed using MATLAB R2018a, NumPy 1.15, and SciPy 1.2.1. Analysis code from our value-distribution decoding 


analyses, as well as code used to generate model predictions for distributional TD, are available at https://doi.org/10.17605/OSF.1O/ 
UX5RG. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The neuronal data analyzed in this work have been uploaded to OSF and are available at https://doi.org/10.17605/OSF.IO/UX5RG. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size All data were taken from a previously-published set of experiments (Eshel et al 2015, 2016). 
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Data exclusions No data were excluded. 
Replication Cross-validation was used to avoid over-fitting. 


Randomization There were no between-animal manipulations. Trial sequences were randomized. 


Blinding There were no design factors to which blinding would be relevant. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
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Eukaryotic cell lines Flow cytometry 
| | Palaeontology |] MRI-based neuroimaging 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals The rodent data we re-analyzed here were first reported in Eshel et al. (2015). We used 5 adult male mice, backcrossed for more 
than 5 generations with C57/BL6J mice, that were heterozygous for Cre recombinase under the control of either the DAT gene or 
the Vgat gene. 

Wild animals The study did not involve wild animals. 

Field-collected samples The study did not involve field-collected samples. 

Ethics oversight All experiments were performed in accordance with the US National Institutes of Health Guide for the Care and Use of 


Laboratory Animals and approved by the Harvard Institutional Animal Care and Use Committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Empirical and anecdotal evidence has associated stress with accelerated hair greying 
(formation of unpigmented hairs)'”, but so far there has been little scientific 
validation of this link. Here we report that, in mice, acute stress leads to hair greying 
through the fast depletion of melanocyte stem cells. Using a combination of 
adrenalectomy, denervation, chemogenetics™, cell ablation and knockout of the 
adrenergic receptor specifically in melanocyte stem cells, we find that the stress- 
induced loss of melanocyte stem cells is independent of immune attack or adrenal 
stress hormones. Instead, hair greying results from activation of the sympathetic 
nerves that innervate the melanocyte stem-cell niche. Under conditions of stress, the 
activation of these sympathetic nerves leads to burst release of the neurotransmitter 
noradrenaline (also knownas norepinephrine). This causes quiescent melanocyte stem 
cells to proliferate rapidly, and is followed by their differentiation, migration and 
permanent depletion from the niche. Transient suppression of the proliferation of 
melanocyte stem cells prevents stress-induced hair greying. Our study demonstrates 
that neuronal activity that is induced by acute stress can drive a rapid and permanent loss 
of somatic stem cells, and illustrates an example in which the maintenance of somatic 
stem cells is directly influenced by the overall physiological state of the organism. 


Stress has been anecdotally associated with a variety of changes in 
tissues, including hair greying. However, whether external stressors 
are the causal factors, and whether stress-related changes occur at 
the level of somatic stem cells, remain poorly understood. The hair 
follicle cycles between growth (anagen), degeneration (catagen) and 
rest (telogen)°. The bulge and hair germ region of the follicle contains 
two populations of stem cells: hair follicle stem cells (HFSCs), which 
are epithelial tissues, and melanocyte stem cells (MeSCs)°, which are 
derived from the neural crest. HFSCs and MeSCs are normally quiescent 
except during early anagen, when they are activated concurrently to 
regenerate a pigmented hair’*®. Activation of HFSCs produces anew hair 
follicle. Activation of MeSCs generates differentiated melanocytes that 
migrate downwards, whereas MeSCs remain close to the bulge. At the 
hair bulb, differentiated melanocytes synthesize melanin to colour the 
newly regenerated hair fromthe root. At catagen, mature melanocytes 
are destroyed, leaving only the MeSCs that will initiate new rounds of 
melanogenesis in future cycles*”° (Extended Data Fig. 1a). The pre- 
dictable behaviour of MeSCs and melanocytes, and the visible nature 
of hair colour, makes the melanocyte lineage an accessible model to 
investigate how stress influences tissue regeneration. 


Diverse stressors induce hair greying 


To examine whether psychological or physical stressors promote 
hair greying, we used three approaches to model stress in C57BL/6) 
mice with black coat colour: restraint stress”, chronic unpredictable 
stress" and nociception-induced stress (which was achieved through 
an injection of resiniferatoxin (RTX), an analogue of capsaicin’®”’). All 
three procedures led to increased numbers of unpigmented white 
hairs over time. Restraint stress and chronic unpredictable stress led 
to noticeable hair greying after three to five rounds of hair cycles. 
Nociception-induced stress produced the most pronounced and rapid 
effect—many new hairs that formed in the next hair cycle after RTX 
injection became unpigmented (Fig. 1a, b, Extended Data Fig. 1b, c). 
Psychological or physical stressors trigger the adrenal glands to 
release stress hormones and catecholamines into the bloodstream”. 
In accordance with this, we detected anincrease in both corticosterone 
(the primary glucocorticoid stress hormone in rodents that is equivalent 
tocortisol in humans) and noradrenaline (acatecholamine) inthe blood 
of mice that were subjected to different stressors (Fig. 1c, Extended Data 
Fig. 1d), suggesting that our approaches induced classic stress responses. 
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Fig.1| Stress depletes MeSCs. a, Black-coated C57BL/6J mice were subjected to 
different stress models. CUS, chronic unpredictable stress. b, Hair greying 
after injection of mice with RTX or saline. Right, quantification of the area of 
skin covered by white hairs (n =10 mice for each condition, two-tailed unpaired 
t-test). c, Liquid chromatography with tandem mass spectrometry (LC-MS/MS) 
was used to quantify the concentration of serum stress hormones 
(corticosterone and noradrenaline) after injection of RTX alone or in 
combination with buprenorphine (bup) (n= 6 mice for each condition, one-way 
analysis of variance (ANOVA) with Tukey’s multiple comparisons test). 

d, Injection of RTX with buprenorphine blocks formation of white hairs (n=6 
mice for each condition, two-tailed unpaired t-test). e, Top, experimental 


RTX induces nociception by activating nociceptive sensory neu- 
rons’’, Blocking the ability of mice to sense pain with buprenorphine 
(an opioid analgesic) prevented the increase of corticosterone and 
noradrenaline after RTX injection, suggesting that blocking pain sensa- 
tion alleviates the physiological stress responses that are induced by 
RTX (Fig. 1c). Moreover, buprenorphine also suppressed the formation 
of white hairs in mice that were injected with RTX (Fig. 1d). These data 
showthat regardless of the stress modality, premature hair greying can 
occur under stress. Because the effect of nociception induction on hair 
greying was the strongest and fastest of all the stressors we tested, we 
focused on RTX injection as our primary stressor. 


Stress leads to loss of MeSCs 


Loss of hair pigmentation can be due to defects in the synthesis of 
melanin”, loss of differentiated melanocytes” or problems in the 
maintenance of MeSCs”. To understand how stress affects the mel- 
anocyte lineage, we injected RTX into mice during anagen, a stage in 
which both MeSCs and differentiated melanocytes were present but 
located within distinct compartments—MeSCs were near to the bulge, 
whereas differentiated melanocytes were at the hair bulb (Fig. le). After 
injection of RTX, TRP2* MeSCs were significantly reduced in number 
across the entire skin (Fig. le). In many hair follicles, MeSCs were com- 
pletely lost from the bulge within five days, whereas differentiated 
melanocytes in the same hair follicle remained unperturbed (Fig. le, 
Extended Data Fig. le). These differentiated melanocytes continued 
to generate pigments, and the hair coat remained black five days after 
RTX injection (Extended Data Fig. 1f, g). When hair follicles in the RTX- 
injected mice entered catagen and telogen, many had lost all MeSCs 
(Fig. le). Subsequently, when the next round of anagen started, differ- 
entiated melanocytes were not produced to colour new hair shafts, and 
unpigmented hairs emerged (Fig. le, Extended Data Fig. 1h). Although 
some regenerated hairs remained pigmented, the numbers of MeSCs 
in these pigmented hairs were also reduced compared to those in mice 
not treated with RTX (Extended Data Fig. li). RTX injection led to the 
same extent of hair greying in both male and female mice (Extended 
Data Fig. 1j). Moreover, RTX also caused the loss of MeSCs when injected 
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during telogen. In this case, unpigmented hairs appeared as soon as 
new hairs emerged in the following anagen (Extended Data Fig. 2a, 
b). These results suggest that MeSCs are exquisitely sensitive to RTX- 
induced stress, whereas differentiated melanocytes and the synthesis 
of melanin are not directly affected. MeSCs were also lost or reduced 
in mice that were subjected to restraint stress or chronic unpredict- 
able stress (Extended Data Fig. 2c). Because stress depleted MeSCs, 
the loss of hair pigmentation in all three conditions was permanent 
(Extended Data Fig. 2d). Collectively, these data indicate that stress 
leads to loss of MeSCs. 


Noradrenaline drives loss of MeSCs 


Next, we asked how stress transmits to the periphery to alter MeSCs 
(Fig. 2a). Immune attack has been postulated to cause stress-induced hair 
greying””*. To test the involvement of the immune system, we injected 
RTX into Ragi mutant mice, which lack both T and B cells, and into CD1Ib- 
DTR mice, in which myeloid lineages had been ablated by diphtheria 
toxin. Injection of RTX into these immune-deficient mice still resulted 
information of white hairs, suggesting that RTX-induced hair greying is 
independent of T cells, Bcells or myeloid cells (Extended Data Fig. 3a, b). 

Asall stressors led to an increase in the levels of corticosterone and 
noradrenaline in the blood, we asked if these stress-induced circulating 
factors hada role inthe stress-induced loss of MeSCs. RNA sequencing 
(RNA-seq) data from MeSCs that were purified by fluorescence-acti- 
vated cell sorting (FACS) suggested that MeSCs express the glucocorti- 
coid receptor (GR, also knownas Nr3cl, areceptor for corticosterone) 
and the B,-adrenergic receptor (Adrb2, a receptor for noradrenaline) 
(Extended Data Fig. 3c, Methods). To determine whether the glucocorti- 
coid receptor mediated the effects of stress on MeSCs, we depleted this 
protein in MeSCs using Tyr-CreERT2—a tamoxifen-inducible CreERT2 
fusion protein directed by a mouse tyrosinase promoter®”*”°, RTX 
injection into Tyr’"?-GR™ mice still resulted in hair greying (Extended 
Data Fig. 3d). Moreover, no changes in MeSCs or hair pigmentation were 
observed when the levels of corticosterone were increased by feeding 
(Extended Data Fig. 3e). These data suggest that corticosterone is not 
a major driver of stress-induced loss of MeSCs. 
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Fig. 2 | Noradrenaline drives hair greying. a, Possible mechanisms of loss of 
MeSCs.b, Injection of RTX into Tyr'"**!?; Adrb2/" (MeSC-specific Adrb2 
conditional knockout (cKO)) mice does not trigger hair greying (n= 6 mice for 
each condition, two-tailed unpaired t-test). NA, noradrenaline. c, Formation of 
white hairs at sites of noradrenaline injection (n=10 injected sites from 8 mice 
for each condition). For quantifications, see Extended Data Fig. 4e. Yellow 
dashed circles denote intradermal injection sites. d, Formation of white hairs 
after RTX injection in adrenalectomized mice (ADX) (n=6 mice for each 
condition, two-tailed unpaired t-test). Alldata are mean +s.d. 


We then investigated whether ADRB2 might mediate the effect of 
stress on MeSCs. After injection of RTX, we observed a marked induc- 
tion of phosphorylated CREB (a downstream effector of ADRB2) in 
MeSCs, but not in mature melanocytes (Extended Data Fig. 4a). Moreo- 
ver, when we depleted ADRB2 from MeSCs using Tyr—CreERT2, white 
hairs did not form after RTX injection (Fig. 2b). These data suggest that 
ADRB2 expressed by MeSCs is essential for stress-induced hair greying. 
By contrast, when ADRB2 was depleted from hair follicle stem cells that 
share the same niche with MeSCs, RTX injection still resulted in hair 
greying (Extended Data Fig. 4b). Inthe absence of stress, depletion of 
ADRB2in MeSCs did not lead to changes in MeSCs, melanocytes or pig- 
ment production, suggesting that the noradrenaline-ADRB2 pathway is 
dispensable for melanogenesis during the normal hair cycle (Extended 
Data Fig. 4c, d). Collectively, these data suggest that noradrenaline sig- 
nals through ADRB2 on MeSCs to mediate stress-induced hair greying. 

To test whether an increase in noradrenaline was sufficient to cause 
hair greying in the absence of stress, we introduced noradrenaline 
locally to the skin through intradermal injections. Local injection of 
noradrenaline promoted hair greying at the injection sites in wild- 
type mice and mice with HFSC-specific knockout of Adrb2, but did 
not cause hair greying in mice with MeSC-specific knockout of Adrb2 
(Fig. 2c, Extended Data Fig. 4e-g). Together, our data demonstrate that 
although immune cells and corticosterone are dispensable, noradrena- 
line signalling appears to be necessary for stress-induced hair greying 
and sufficient to trigger hair greying in the absence of stress. 


Finding the source of noradrenaline 


As the adrenal gland is a major source of noradrenaline under stress, 
to determine whether noradrenaline derived from the adrenal glands 
mediates stress-induced hair greying, we surgically removed both 
adrenal glands. Adrenalectomy significantly reduced the levels of corti- 
costeroneand noradrenaline in the bloodstream of RTX-injected mice 
(Extended Data Fig. 5a). However, injection of RTX into adrenalecto- 
mized mice still caused hair greying—suggesting that RTX-induced 
hair greying is independent of hormones or catecholamines from the 
adrenal glands (Fig. 2d). 
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An alternative source of noradrenaline is the sympathetic nerv- 
ous system. Under stress, the sympathetic nervous system becomes 
activated to induce fight-or-flight responses through the secretion of 
noradrenaline from peripheral axon terminals”. In the skin, sympa- 
thetic nerves terminate close to the bulge where MeSCs reside (Fig. 3a). 
Furthermore, skin regions with high numbers of unpigmented hairs 
also have denser sympathetic innervation (Extended Data Fig. 5b). 

To determine whether sympathetic nerves are indeed activated 
after RTX injection, we examined the levels of FOS, an immediate early 
transcription factor that serves as a reporter of neuronal activity”. 
Robust induction of FOS was detected in the cell bodies of sympathetic 
neurons within 1 hour of RTX injection, peaking at around 2-4 hours 
and diminishing after 24 hours, which suggests that RTX injection led 
to a burst activation of sympathetic neurons (Fig. 3b, Extended Data 
Fig. 5c). Moreover, when buprenorphine was injected together with 
RTX to block pain, FOS expression was not induced in sympathetic 
neurons (Fig. 3b). These data suggest that the sympathetic nervous 
system becomes highly activated following nociception-induced stress. 

To test whether activation of sympathetic nerves is responsible for 
the loss of MeSCs and hair greying under stress, we used 6-hydroxy 
dopamine (6-OHDA)—a selective neurotoxin for sympathetic nerves*®— 
to ablate sympathetic nerves. Sympathectomy blocked RTX-induced 
hair greying and loss of MeSCs (Fig. 3c, Extended Data Fig. 5d), sug- 
gesting that sympathetic nerves do mediate stress-induced hair grey- 
ing. In addition, guanethidine—a chemical that blocks the release of 
noradrenaline from sympathetic nerve terminals”’—suppressed hair 
greying and loss of MeSCs after RTX injection (Extended Data Fig. 5e). 
Collectively, these data suggest that noradrenaline secreted from sym- 
pathetic nerve terminals mediates the effect of stress on MeSCs. 

To determine whether the activation of sympathetic nerves in the 
absence of stress is sufficient to drive the loss of MeSCs, we took a 
chemogenetic approach using the DREADDs (designer receptors 
exclusively activated by designer drugs) system**. G,-DREADD is an 
artificial G,-protein-coupled receptor that is activated by the inert 
molecule clozapine N-oxide (CNO), but not by endogenous ligands. 
Activation of G,DREADD leads to intracellular release of calcium and 
neuronal firing. We generated TH“*”?;CAG-LSL-G,-DREADD;Rosa”™"“ 
mice in which the sympathetic nerves can be activated artificially with 
CNO (Fig. 3d, Supplementary Discussion). Injection of CNO induced 
the activation of FOS insympathetic ganglia, confirming the efficacy of 
this strategy (Extended Data Fig. 5f). Activation of sympathetic nerves 
with the DREADD system led to loss of MeSCs and hair greying at the 
sites where CNO was injected (Fig. 3d, Extended Data Fig. 5g). Moreover, 
when TH-CreERT2 was activated mosaically by a low dose of tamox- 
ifen, intradermal CNO injection resulted in loss of MeSCs only in hair 
follicles innervated by DREADD-positive nerve fibres (recognizable 
by their expression of GFP in the membrane; Fig. 3e, Extended Data 
Fig. 5h). These data suggest that activation of sympathetic nerves inthe 
absence of stressors is sufficient to drive the loss of MeSCs. Together, 
our findings suggest that increased noradrenaline secreted from the 
sympathetic nerve terminals drives the depletion of MeSCs under 
stress. 


Stress drives hyperproliferation of MeSCs 


Next, we aimed to identify the early changes in MeSCs under stress that 
might account for their loss (Fig. 4a). Immunofluorescence failed to 
detect active caspase-3 or TUNEL (terminal deoxynucleotidyl trans- 
ferase dUTP nick end labelling) signals in MeSCs before their depletion 
from the niche after injection of RTX or noradrenaline. Moreover, RTX 
injection into Ripk3 mutant mice, which lack a key kinase for necrosis, 
still caused hair greying (Extended Data Fig. 6a—c). These data sug- 
gest that stress-induced loss of MeSCs is not mediated by apoptosis or 
necrosis. Radiation causes DNA damage in MeSCs, and leads to their 
differentiation within the niche”. However, we failed to detect y-H2AX 
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Fig. 3 | Hyperactivation of the sympathetic nervous system depletes MeSCs. 
a, Sympathetic nerves innervate MeSC niches. White arrowhead onthe 
immunofluorescent staining indicates the close proximity of nerve endings to 
MeSCs (n= 6 mice for each condition). b, Left, immunofluorescent staining of 
sympathetic ganglia for tyrosine hydroxylase (TH; green) and FOS (red) from 
mice injected with saline, RTX and RTX with buprenorphine. Right, 
quantification of FOS* cells in sympathetic ganglia (SG) (n=6 ganglia from3 
mice for each condition, one-way ANOVA with Tukey’s multiple comparisons 
test).c, Injection of 6-hydroxydopamine (6-OHDA) blocks loss of MeSCs and 
induction of white hairs by RTX (n= 30 hair follicles from 6 mice for each 
condition, two-tailed unpaired t-test). SN abla, sympathetic nerve ablation. See 
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also Extended Data Fig. Sd. d, Left, schematic of sympathetic nerve activation 
using aG,-DREADD system. Right, immunofluorescent staining for TH (green) 
and TRP2 (red) from TH““"”?;CAG-LSL-G,-DREADD mice treated with saline or 
CNO (n= 30 hair follicles from 6 mice for each condition, two-tailed unpaired 
t-test). e, Mosaic activation of sympathetic nerves in TH""”?;CAG-LSL-G,- 
DREADD;Rosa™""” mice. Bar graphs quantify the number of MeSCs in hair 
follicles innervated by DREADD-negative sympathetic nerves (without 
DREADD) versus DREADD-positive sympathetic nerves (with DREADD; marked 
by membrane GFP expression) (n= 30 hair follicles for each condition from4 
mice, two-tailed unpaired f-test). Scale bars, 50 pm. All dataare mean +s.d. 


foci (a hallmark of DNA damage) in MeSCs after injection of RTX or 
noradrenaline, which suggests that stress-induced depletion of MeSCs 
is not mediated through DNA damage (Extended Data Fig. 6d). 

Quiescence isa key feature of many somatic stem cells®* *, and loss of 
quiescence has been postulated to cause loss of MeSCs in Bc/2 mutant 
mice’°**, To examine whether stress alters quiescence in MeSCs, we 
injected RTX or noradrenaline into mice that had entered full anagen 
(when MeSCs are normally quiescent). We saw a marked increase in 
the number of proliferating MeSCs within 24 hours of the injection of 
RTX or noradrenaline—about 50% of MeSCs became positive for phos- 
phorylated histone H3, a marker of M phase (Fig. 4b). This number isin 
sharp contrast to the number of proliferating MeSCs in early anagen 
(about 6%), the only stage when MeSCs proliferate to self-renew?*> 
(Extended Data Fig. 6e). Conversely, no changes in proliferation or 
apoptosis were observed in mature melanocytes after injection of RTX 
or noradrenaline (Extended Data Fig. 6f, g). These data suggest that 
increased noradrenaline forces MeSCs to enter a rapid and abnormally 
proliferative state, but does not affect mature melanocytes. 

To monitor changes in MeSCs after stress, we generated 
Tyr"?8"2-Rosa™’™© mice, in which MeSCs can be traced by their mem- 
brane expression of GFP (Fig. 4c). Consistent with the observation that 
proliferation is an early response of MeSCs to stress, we saw a transient 
increase in GFP’ cells shortly after RTX injection (Fig. 4c, day 1; quanti- 
fied by FACS in Extended Data Fig. 6h). After this initial phase, many 
GFP* cells began to exhibit striking dendritic branching, which is a 
characteristic feature of differentiated MeSCs (Fig. 4c, day 2). They also 
began to move away from the bulge—some migrated downwards along 
the hair follicle, and some migrated out into the dermis or epidermis 
(Fig. 4c, days 2 and 3). By day 3, many GFP’ cells had migrated out of 
the bulge, and by day 4, many hair follicles had lost all GFP* cells inthe 
bulge. Moreover, ectopic pigmentation could be detected along the 
hair follicle, epidermis and dermis—areas that are normally devoid of 
pigments (Fig. 4d, Extended Data Fig. 6i). These data suggest that after 
stress, MeSCs undergo rapid proliferation followed by differentia- 
tion and migration, which leads to their loss from the niche (Fig. 4e, 
Supplementary Discussion). 
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Fig. 5 | Inhibition of aberrant MeSC proliferation prevents stress-induced 
hair greying. a, Experimental workflow. FACS was performed in cells at 
telogen. b, GO enrichment analysis of significantly dysregulated genes in 
stressed MeSCs (n=2 biologically independent samples for each condition, 
Fisher’s exact test). c, Heat map of signature gene expression for genes that are 
related to the proliferation of MeSCs (n=2 biologically independent samples 
for each condition). d, qRT-PCR of genes that are related to MeSC proliferation 
and differentiation in cultured primary human melanocytes treated with 
noradrenaline (n=6 samples from 3 independent donors, two-way ANOVA with 
Benjamini-Hochberg correction). e, Top, immunofluorescent staining for 
TRP2 (red) from mice five days after treatment with RTX, RTX with CDK 
inhibitors (AT7519 or flavopiridol (flavo)) or RTX with MeSC-specific 
overexpression of P27 (P27 OE). Bottom, quantification of MeSCs (n=30 hair 
follicles from 6 mice for each condition, one-way ANOVA with Tukey’s multiple 
comparisons test). f, Topical treatment with AT7519, flavopiridol or MeSC- 
specific P27 overexpression inhibits RTX-induced hair greying (n= 6 mice for 
each condition, one-way ANOVA with Tukey’s multiple comparisons test). 

g, Model summarizing the main findings of the study. Under strong external 
stressors, activated sympathetic nerves secrete noradrenaline that binds to 
ADRB2 on MeSCs. Noradrenaline-ADRB2 signalling drives rapid proliferation 
of MeSCs, followed by their ectopic differentiation and exhaustion. Scale bars, 
50 um. All dataare mean+s.d. 


Transcriptome analyses of MeSCs 

To discover the molecular mechanisms that drive stress-induced loss of 
MeSCs, we conducted RNA-seq using FACS-purified MeSCs from con- 
trol and RTX-treated mice 12 hours after RTX injection, before MeSCs 
showed phenotypic differences (Fig. 5a, Extended Data Fig. 7a-c). 
Examination of the expression of marker genes for different types 
of skin cell confirmed that we had successfully enriched for MeSCs 
(Extended Data Fig. 7d). To uncover major molecular changes, we 
conducted Gene Ontology (GO) enrichment analysis (Fig. 5b). We 
also curated a list of known genes that are associated with the prolif- 
eration and differentiation of MeSCs (Fig. 5c, Extended Data Fig. 7e). 
Moreover, we used a list of genes that have previously been shown to 
be involved in entry into the cell cycle to assess whether regulators of 
the cell cycle are altered at the transcriptional level®* (Extended Data 
Fig. 7f). Some of these key changes were also verified by quantitative 
PCR with reverse transcription (qRT-PCR) (Extended Data Fig. 7g). 
Collectively, we identified changes in several cell-cycle regulators in 
stressed MeSCs, including cyclin-dependent kinase 2 (Cdk2), whichis 
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akey promoter of the G1-to-S transition. Genes encoding receptors for 
ligands that promote the proliferation, differentiation and migration 
of MeSCs, including Kit?’ and Mcir°*®, were also upregulated. In addi- 
tion, genes that are involved in melanogenesis”, including Mitf, Tyrp1, 
Tyr, Oca2 and Pmel, were upregulated (Fig. 5c, Extended Data Fig. 7e, 
g). These data suggest that MeSCs upregulate their proliferation and 
differentiation programs after stress. Furthermore, treatment with 
noradrenaline also led toa rapid induction of proliferation-associated 
genes such as Cdk2 and differentiation-associated genes such as Mitf 
and Tyrin cultured human melanocyte cells (Fig. 5d). These data suggest 
that noradrenaline elicits similar responses in melanocyte lineages of 
both humans and mice. 


Blocking proliferation preserves MeSCs 


Because MeSCs first lose quiescence when under stress, we asked 
whether transient suppression of proliferation early in the stress 
response might prevent their depletion. We injected RTX at full ana- 
gen and applied CDK inhibitors (AT7519 or flavopiridol) topically to 
suppress proliferation transiently until 48 hours after injection®”*°. 
MeSCs in RTX-injected mice that were treated with CDK inhibitors 
remained quiescent and were preserved in the niche (Fig. Se, Extended 
Data Fig. 8a). Proliferation of cells in the hair bulb remained largely 
normal, probably because the penetration of inhibitors into subcu- 
taneous regions in full anagen was limited (Extended Data Fig. 8b). To 
further confirm that loss of MeSCs can be prevented by inhibiting their 
proliferation, we generated a genetic mouse model, Tyr'“*?;Rosa’"™™; 
TetO’””, in which the CDK inhibitor P27 can be transiently induced with 
doxycycline in MeSCs. Induction of P27 expression in MeSCs suppressed 
their aberrant proliferation and preserved MeSCs in the niche under 
stress (Fig. Se, Extended Data Fig. 8a). These preserved MeSCs showed 
an undifferentiated morphology and retained their functionality, as 
newly regenerated hairs in subsequent cycles maintained pigmentation 
(Fig. 5f). Together, these data suggest that loss of quiescence drives the 
depletion of MeSCs under conditions of stress, and that suppression of 
the proliferation of MeSCs is sufficient to prevent their loss. 


Discussion 


Acute stress is known to cause transient and beneficial fight-or-flight 
responses that are essential for survival. Here, we demonstrate that 
acute stress can also cause non-reversible depletion of somatic stem 
cells through activation of the sympathetic nervous system—resulting 
in permanent damage to tissue regeneration (Fig. 5g). Our findings sup- 
port the emerging notion that the sympathetic nervous system not only 
regulates body physiology, but also influences a variety of processes 
in development and tissue maintenance”*' “, The adrenal glands are 
the central regulators of stress responses. However, we show that the 
adrenal-gland-derived circulating stress hormones and catecholamines 
do not drive changes in MeSCs under stress. As sympathetic nerves 
innervate essentially all organs, acute stress might have a broad and 
rapid effect on many tissues through neuronal signals. 

The reason for the existence of such an interaction between nerves 
and MeSCs is unknown. This said, the connection between the nerv- 
ous system and pigment-producing cells is probably conserved dur- 
ing evolution. Cephalopods like squid, octopus and cuttlefish have 
sophisticated colouration systems that allow them to change colour 
for camouflage or communication. Neuronal activities control their 
pigment-producing cells (chromatophores), allowing rapid changes 
in colour in response to predators or threats“. Therefore, an attractive 
hypothesis is that sympathetic nerves might modulate MeSC activity, 
melanocyte migration or pigment production in situations independent 
of the hair cycle—for example, under bright sunlight or UV irradiation®. 
Under extreme stress, however, hyperactivation of neuronal activities 
overstimulates the pathway, which drives the depletion of MeSCs. 


MeSCsalso exhibit ectopic differentiation and depletion withage©”°. 
Of relevance, patients who have undergone partial sympathectomy 
develop fewer numbers of unpigmented hairs on the sympathecto- 
mized side with age**””. In the future, it will be interesting to investigate 
whether the mechanisms that we have uncovered here also contribute 
to the loss of MeSCs that occurs during ageing, and whether stress 
might mimic an accelerated ageing process. 
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Methods 


Randomization 

The mice were randomly assigned into different experimental groups 
whenever possible, except in experiments that required specific geno- 
types (in this case, sex-matched littermate controls were used). 


Blinding 

For LC-MS/MS analysis, RNA-seq library preparation and sequencing, 
experimenters were blinded to experimental conditions. Blinding was 
not possible in mouse studies owing to the need to identify specific 
genotypes or treat mice with different chemicals according to experi- 
mental designs. 


Mice 

C57BL/6), Tyr?" KIS“??? Rag] mutant, CD11b-DTR, GR™, CAG-LSL-G,- 
DREADD, Rosa!?8 Fr/ePl-mcherry Rosq26™™™>, Rosa’ and Ripk3 mutant 
mice were obtained from the Jackson Laboratory. Adrb2“ mice“ were 
originally generated by G. Karsenty (Columbia University) and provided 
tous byP. Frenette (Albert Einstein College of Medicine). TH”“"” mice” 
were generated and provided by D. Ginty (Harvard Medical School). 
TetO’” mice” were originally generated by G. Cady (Roswell Park Cancer 
Institute) and provided to us by V. Greco (Yale School of Medicine). 
All experiments used balanced groups of male and female mice. All 
experiments were conducted using mice at the same stage of the hair 
cycle and inacomparable age range (postnatal day (P)20-P25 for first 
telogen, P31-P36 for full anagen and P50-P60 for second telogen, or 
long-term monitoring as specified). To monitor the hair cycle, mice 
were shaved at weaning to monitor changes in skin colour, and these 
were confirmed by skin sections. The acquisition of human melanocyte 
cells was carried out in accordance with the Institutional Review Board 
policies at Massachusetts General Hospital. All mice were maintained 
in an animal facility approved by the Association for Assessment and 
Accreditation of Laboratory Animal Care at Harvard University, Har- 
vard Medical School and Ribeirao Preto Medical School. Procedures 
were approved by the Institutional Animal Care and Use Committee 
of all institutions and complied with all relevant ethical regulations. 


Stress procedures 

Restraint and chronic unpredictable stress procedures were performed 
as previously described". In brief, for restraint stress, CS7BL/6J 
mice were kept in a restrainer (Thermo Fisher Scientific 12972590) 
for four hours per day for five days starting from mid-anagen (P28- 
P30). Hairs were depilated to induce hair regeneration when their hair 
cycle reached telogen. Mice were stressed and depilated four roundsin 
total to monitor long-term changes. For chronic unpredictable stress, 
C57BL/6) mice were exposed to a combination of stressors. Two of 
the stressors were applied each day. The stressors included cage tilt, 
isolation, damp bedding, rapid light and dark changes, overnight illu- 
mination, restraint, empty cage and changing the cage three times. All 
stressors were randomly repeated in consecutive weeks. 


Drug treatments 

For RTX treatment (see also Supplementary Discussion), mice received 
injections of RTX (30-100 pg kg”) in the flank for 1-3 days, as described 
previously>"**! °°, RTX was prepared in 2% DMSO with 0.15% Tween 80 in 
PBS. Control mice were treated with the vehicle only. RTX injection was 
done either in full anagen (P31-P36) or in first telogen (P21). For corti- 
costerone feeding, 35 tg mI corticosterone (Millipore Sigma, C2505) 
was dissolved in 0.45% hydroxypropyl-B-cyclodextrin and provided in 
the drinking water. Mice were treated for three days (P28-P30). Control 
mice received the vehicle water (0.45% B-cyclodextrin). For analgesia, 
mice were injected with buprenorphine (0.1 mg kg”) 4h before RTX 
injection and every 6 h after RTX injection for 2 days. For tamoxifen 
treatment, tamoxifen was diluted in corn oil to a final concentration 


of 20 mg mI”. To induce recombination, 20 mg kg” was injected 
intraperitoneally once per day for 4-7 days. For mosaic induction of 
Tyr-CreERT2 and TH-CreERT2, 20 mg kg! tamoxifen was injected 
intraperitoneally once per day for 3 days. For intradermal noradrenaline 
injection, noradrenaline (Sigma-Aldrich 489350) solution was prepared 
freshly by dissolving in 0.1% ascorbic acid in 0.9% sterile NaCl toa final 
concentration of 2 mM. Fifty microlitres was injected intradermally 
into experimental mice together with fluorescent beads at full anagen 
(around P31-P36). Control mice were injected with an equivalent vol- 
ume of vehicle (0.1% ascorbic acid in 0.9% sterile NaCl) with fluorescent 
beads. The injection sites were marked using water-resistant ink. For 
ablation of sympathetic nerves, 6-hydroxydopamine hydrobromide 
(6-OHDA, Sigma 162957) solution was prepared freshly by dissolving 
6-OHDA in 0.1% ascorbic acid in 0.9% sterile NaCl. Mice were injected 
intraperitoneally with 6-OHDA (100 mg per kg body weight) daily from 
P18 to P22. Control mice were injected with an equivalent volume of 
vehicle (0.1% ascorbic acid in 0.9% sterile NaCl). Ablation efficiency 
in the skin was confirmed by immunofluorescence staining. For gua- 
nethidine treatment, mice were intraperitoneally injected with 30 mg 
per kg body weight of guanethidine (Sigma-Aldrich, 1301801), once a 
day for 3 consecutive days before RTX administration at full anagen 
(around P31-P36). For induction of G,-DREADD, 50 pl CNO (1mg mI™ 
in 0.9% sterile saline) was injected intradermally into experimental 
mice, together with fluorescent beads at full anagen (around P31-P36). 
Control mice were injected with an equivalent volume of vehicle (0.9% 
sterile saline) together with fluorescent beads. For administration 
of diphtheria toxin, diphtheria toxin (Sigma-Aldrich) was dissolved 
in 0.9% saline (0.1 mg ml). For ablation, CD11b-DTR transgenic mice 
were intraperitoneally injected with diphtheria toxin (25 ng per g body 
weight) daily 3 days before RTX injection at full anagen (around P31- 
P36). Diphtheria toxin (20 ng per g body weight) was injected every 3 
days after RTX injection until collection of skin samples. For inhibitor 
treatment, mice were shaved and pre-treated with AT7519 (Cayman 
Chemical 16231) or flavopiridol (Cayman Chemical 10009197) (bothS 
mg per kg body weight) in ethanol topically 48 h and 24 h before RTX 
injection, at the time of RTX injection and 24 hand 48 h after injection. 
For induction of P27 expression, mice were fed with the doxycycline 
rodent diet (VWR 89067-462) for three days before RTX treatment 
and three days after. RTX was given at Anagen VI (around P31-P36). 


Quantification of unpigmented hairs 

For restraint and chronic unpredictable stress, unpigmented hairs were 
quantified by plucking around 100 hairs from 3 or 4 regions of the skin 
across the anterior to posterior end, and the percentage of white hairs 
was calculated by dividing the number of white hairs by the total num- 
ber of hairs plucked. For RTX injection experiments, the percentage 
of regions with white hairs was calculated by dividing the size of areas 
with white hairs by the size of the whole skin (both areas were measured 
using Image)). For intradermal injection experiments (noradrenaline 
or CNO), unpigmented hairs were quantified by plucking around 100 
hairs from each injection site (marked by water-resistant ink at the 
time of injection), and the percentage of white hairs was calculated by 
dividing the number of white hairs by the total number of hairs plucked. 


Histology and immunohistochemistry 

Mouse skin samples were fixed using 4% paraformaldehyde (PFA) for 
15 min at room temperature, washed 6 times with PBS and immersed in 
30% sucrose overnight at 4 °C. Samples were then embedded in optimal 
cutting temperature (OCT) compound (Sakura Finetek). Sections of 
approximately 35-50 um were fixed in 4% paraformaldehyde (PFA) for 2 
min and washed with PBS and 0.1% Triton X-100 in PBS. Slides were then 
blocked using blocking buffer (5% donkey serum; 1% BSA, 2% cold water 
fish gelatin in 0.3% Triton X-100 in PBS) for 1h at room temperature, 
followed by staining with primary antibodies overnight at 4 °C and 
secondary antibody for 4 h at room temperature. For quantification 


of the density of sympathetic nerves, 90-y1m sections were used. The 
TUNEL assay was performed according to the manufacturer’s instruc- 
tions (Roche). Fontana-Masson staining was performed according 
to the manufacturer’s instructions (Market Lab ML7255). Antibodies 
used: TRP2 (Santa Cruz 10451, 1:800), tyrosine hydroxylase (rabbit, 
Millipore AB152, 1:1,000 or sheep, Millipore AB1542, 1:150-1:300), 
FOS (Abcam, 1902839, 1:1,000), y-H2AX (Cell Signaling, 9718, 1:400), 
phosphorylated histone H3 (rabbit, Cell Signaling Technology 3377S, 
1:500), cleaved caspase 3 (rabbit, Cell Signaling Technology 9664S, 
1:400), GFP (rabbit, Abcam ab290, 1:1,000 or chicken, Aves labs GFP- 
1010, 1:200), CD3 (eBioscience 14-0032-81, 1:800), CD11B (eBioscience 
14-0112-81, 1:800), phosphorylated CREB (Cell Signaling 9198, 1:800) 
and MITF (Abcam ab120339, 1:400). 


Measurement of stress hormones 

Asample of 50 pl of blood plasma was collected from each mouse 
and transferred into a1.5-ml microcentrifuge tube. Ten microlitres of 
internal solution was added to each sample followed by 100 pl water 
and 640 ul methanol. Samples were incubated at —20 °C for 1h, then 
centrifuged for 30 min at maximum speed at —9 °C. The supernatant was 
transferred to anew tube and dried under N2 flow, then resuspended 
in 50 pl methanol and transferred to micro-inserts. All samples were 
runonan Agilent 6460 Triple Quad LC/MS with an Agilent 1290 Infinity 
HPLC. For corticosterone-treated mice, plasma corticosterone levels 
were determined by ELISA according to the manufacturer’s instructions 
(Arbor Assays, K014-H1). 


Radiation 

Ten-week-old C57BL/6J mice were gamma-irradiated (’Cs source) with 
a dose of 10.5 Gy. Mice were transplanted with 300,000 whole bone 
marrow cells to ensure survival after lethal irradiation. 


FACS 

Mouse dorsal skin was collected, and the fat layer was removed by 
gentle scraping from the dermal side. The skin was incubated in 0.25% 
collagenase in Hank’s balanced salt solution (HBSS) at 37 °C for 35-45 
min onan orbital shaker. Single-cell suspension was collected by gentle 
scraping of the dermal side and filtering through 70-m and 40-um 
filters. The epidermal layer was incubated in trypsin-EDTA at 37 °C for 
35-45 min on an orbital shaker. Single-cell suspension was collected 
by gentle scraping of the epidermal side and filtering through 70-um 
and 40-um filters. The single-cell suspension was centrifuged for 5 min 
at 4 °C, resuspended in 0.25% FBS in PBS and stained with fluorescent- 
dye-conjugated antibodies for 30 min. For late anagen skin samples, the 
bottom parts of the hair follicles, containing mature melanocytes, were 
removed by gentle scraping under a dissection microscope. The MeSCs 
located close to the bulge remained and were verified by immunostain- 
ing. Antibodies used: CD140A (Invitrogen 13-1401-82, 1:200), CD45 
(Invitrogen 13-0451-82, 1:400), SCA1 (Invitrogen 13-5981-82, 1:1,000), 
CD34 (Invitrogen 13-0341-82, 1:100), CD117 (Biolegend 135136, 1:400). 
See a published protocol for detailed instructions”. 


RNA isolation 

RNA was isolated using a RNeasy Micro Kit (Qiagen) with QlAcube 
according to the manufacturer’s instructions. RNA concentration 
and RNA integrity were determined by Bioanalyzer (Agilent) using the 
RNA 6000 Nano chip. High-quality RNA samples with an RNA integrity 
number of 8 or higher were used as input for RT-PCR and RNA-seq. 


Cell culture 

Primary human melanocytes were derived from neonatal foreskin as 
previously described* and cultured in Medium 254 (Invitrogen, Thermo 
Fisher Scientific). Melanocytes (passages 2 and 4) were starved for 
24 hin Ham’s F-10 nutrient mixture with 1% penicillin-streptomycin-— 
glutamine before the addition of noradrenaline (10 pM). 


qRT-PCR 

The cDNA libraries were synthesized using Superscript IV VILO master 
mix with ezDNase (Thermo Fisher Scientific). qPCR was performed 
using power SYBR green (Thermo Fisher Scientific) on an ABI QuantS- 
tudio6 Flex qPCR instrument. C, values were normalized to an internal 
control of B-actin (Actb). 


Imaging and image analysis 

Allimages were acquired using a Zeiss LSM 880 confocal microscope or 
Keyence microscope with x20 or x40 magnification lenses. Images are 
presented as maximum intensity projection images. For colocalization 
analysis, images are presented as a single z-stack. For quantification 
of the density of sympathetic nerves, tyrosine hydroxylase staining of 
sympathetic nerves was performed on samples of skin sections (90- 
pm thickness) to ensure the capture of all fibres innervating each hair 
follicle. Sympathetic nerves innervating individual hair follicles were 
selected and imaged using a Zeiss LSM 880 confocal microscope. Three- 
dimensional (3D) surfaces of tyrosine hydroxylase staining were created 
using Imaris x64 9.3.0 software and the volume was measured and 
compared. To quantify cell numbers (MeSC numbers, cell death events, 
proliferation events) within a hair follicle, immunofluorescence stain- 
ing images of skin sections from multiple regions across the body were 
used. The number of cells was counted manually or by using Image]. 


Statistical analysis 

Statistical analyses were performed with GraphPad Prism 7.00, using 
unpaired two-tailed Student’s t-tests, one-way or two-way ANOVA. All 
of the statistical tests performed are indicated in the figure legends. 
The data are presented as mean +s.d. No statistical methods were used 
to predetermine sample size. 


RNA-seq and computational analysis 

MeSCs were purified from skin samples from control and stressed 
mice at telogen using FACS, based on their expression of CD117’ and 
starting from a population that is negative for CD140A, CD45, SCA1 
and CD34”. A total of 2 ng of RNA from each sample was used to gen- 
erate RNA-seq libraries using a SMART-Seq v4 Ultra Low Input RNA kit 
(Takara, 634888) and Nextera XT DNA Library Preparation Kit (Illu- 
mina, FC-131-1024). Single-end sequencing reads were obtained using 
the Illumina NextSeq 500 platform. Sequencing reads from RNA-seq 
libraries were trimmed using Trim Galore! (https://www.bioinformat- 
ics.babraham.ac.uk/projects/trim_galore/) and aligned to the mouse 
reference genome (mm10) using STAR aligner’’. Reads with alignment 
quality <Q30 were discarded. Gene expression levels were normalized 
and differential expression of genes was calculated using the DESeq2 
package in R®. Gene set functional enrichment analysis was performed 
using DAVID®”. Transcripts per million (TPM) calculated from count 
tables of control MeSC samples were used to determine the expression 
levels of adrenergic receptors and glucocorticoid receptor shown in 
Extended Data Fig. 3c. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The sequencing data that support the findings of this study have been 
deposited in the Gene Expression Omnibus (GEO) with the accession 
code GSE131566. Source data for all main figures and Extended Data 
figures are provided with the paper. 
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Extended Data Fig. 1| Effects of stress on the hair pigmentation. a, Schematic 
of MeSC behaviour during the hair cycle. b, Hair greying after mice were 
subjected to chronic unpredictable stress. Quantifications were done by 
plucking around 100 hairs from different regions across the skin and counting 
the number of white hairs (n=9 plucked regions from 3 mice for each condition, 
two-tailed unpaired t-test). c, Hair greying after mice were subjected to 
restraint stress. Quantifications as described in b.d, LC-MS/MS quantification 
of corticosterone and noradrenaline after restraint stress (n=5 mice for 
control andn=6 mice for restraint, two-tailed unpaired f-test). e, Left, 
immunofluorescent staining of hair bulbs for melanocyte-inducing 
transcription factor (MITF, red) from mice five days after treatment with saline 
or RTX. Right, quantification of MITF’ cells (n=30 hair follicles from 3 mice for 
each condition, two-tailed unpaired t-test). f, Fontana-Masson staining of hair 
bulbs for melanin from mice five days after treatment with saline or RTX (n=6 
mice for each condition). g, Coat colour in mice five days after RTX injectionin 
anagen. RTX was injected in full anagen and the mice were examined five days 


later, at late anagen. The coat colour remained black (n= 6 mice for each 
condition). h, Fontana-Masson staining of hair follicles for melanin from mice 
that were treated with saline or RTX at first anagen and examined at second 
anagen (see Fig. le for corresponding fluorescent images) (n=6 mice for each 
condition). i, Quantification of the number of MeSCs inthe skin of mice 
injected with saline or RTX. For the RTX-injected mice, the numbers of MeSCs 
in regions with predominantly black hairs and regions with many white hairs 
are quantified separately. Orange and green dashed boxes denote 
representative black and white hair regions, respectively, in RT X-injected mice. 
Enlarged boxes show representative immunofluorescent images of hair 
follicles from each region. White arrowheads indicate regions where MeSCs 
reside (n=30 hair follicles from 3 mice for each condition, one-way ANOVA with 
Tukey’s multiple comparisons test).j, Quantification of the body area covered 
by white hairs in female versus male mice (n=5 mice for each sex, two-tailed 
unpaired ¢-test). Alldataare meant+s.d. 
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Extended Data Fig. 2| Loss of MeSCs in three different models of stress. 

a, Top, schematic of experimental design for RTX injection in first telogen (red 
arrows indicate collection of skinsamples). Bottom left, representative mouse 
images 5 days and 16 days after RTX injection in first telogen. Bottom right, 
quantification of the body area covered by white hairs 16 days after RTX 
injection (n=4 mice for each condition, two-tailed unpaired t-test). 

b, Immunofluorescent staining for TRP2 from mice injected with saline or 

RTX in first telogen (n= 30 hair follicles from 4 mice for each condition, two- 
tailed unpaired t-test). Yellow boxes denote the upper region of the hair follicle, 
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where MeSCs reside. Enlarged views of the yellow-boxed regions are shown to 
the right. White arrowheads indicate MeSCs. Quantifications of MeSCs are also 
shown (bottom). c, Immunofluorescent staining for TRP2 (red) (left) and 
quantification of MeSCs (right) from mice subjected to chronic unpredictable 
stress or restraint stress (n= 30 hair follicles from5 mice for each condition, 
two-tailed unpaired t-test). d, Hair coat colour was monitored in RTX-injected 
mice for multiple rounds of hair-follicle regeneration (waxing was used to 
initiate new rounds of anagen) (n=3 mice for each condition). Schematic 
denotes the experimental design. Scale bars, 50 pm. All dataare mean+s.d. 
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Extended Data Fig. 3 | Stress-induced hair greying is not mediated by 
corticosterone or immune attack. a, Left, formation of white hairs after RTX 
injectionin Ragi mutant mice that are devoid of T and B cells (Ragi KO) (n=6 for 
each condition, two-tailed unpaired t-test). Right, immunofluorescent staining 
for the T cell marker CD3 (green) in control and Ragi KO skin (n= 6 mice for each 
condition, two-tailed unpaired t-test). b, Left, hair greying occurs when RTX is 
injected into CD11b-DTR mice that were treated with diphtheria toxin to deplete 
myeloid cells (n= 6 mice for each condition). Right, immunofluorescent 
staining for CD11B (green) inskin from control and CD11b-DTR mice that were 
treated with diphtheria toxin (n= 6 mice for each condition). c, Expression of 
adrenergic receptors and glucocorticoid receptor (GR) in MeSCs (n=2 


biologically independent samples). d, Formation of white hairs after RTX 
injection into Ty"""”";GR/™ mice (MeSC-GR cKO) (n= 6 mice for each 
condition, two-tailed unpaired ¢-test). e, Left, enzyme-linked immunosorbent 
assay (ELISA) measurement of the level of corticosterone in the blood three 
days after supplying corticosterone in the drinking water (n=4 mice for each 
condition). Middle, immunofluorescent staining of hair follicles for TRP2 (red) 
from mice five days after treatment with corticosterone (n= 30 hair follicles 
from 3 mice for each condition, two-tailed unpaired t-test). Right, coat colour 
after hair follicles in corticosterone-treated mice enter another round of 
anagen to regenerate new hairs. Scale bars, 50 pm. All data are mean +s.d. 
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Extended Data Fig. 4 | Perturbations of the noradrenaline-ADRB2 pathway. 
a, Left, immunofluorescent staining of hair follicles for phosphorylated CREB 
(pCREB; green) and TRP2 (red) 12 hafter RTX injection. Right, quantification of 
the percentage of pCREB* MeSCs (n=30 hair follicles from 3 mice for each 
condition, two-tailed unpaired t-test). White arrowheads indicate pCREB” 
MeSCs in upper hair follicles after RTX injection. b, Formation of white hairs 
after RTX injection into KIS“; Adrb2" mice (HFSC-Adrb2 cKO) (n=3 mice for 
each condition, two-tailed unpaired ¢-test).c, Top left, coat colour in 
unstressed Tyr"*’?; Adrb2/' mice (MeSC-Adrb2 cKO) inthe second telogen 
after treatment with tamoxifen (seven times) at the first telogen. Bottom left, 
immunofluorescent staining of hair bulbs for MITF (red) in Ty""”2;Adrb2™" 
mice in anagen. Right, Fontana-Masson melanin staining of anagen hair follicles 
from Tyr"*”? -Adrb2™' mice (n=3 mice for eachcondition). d, Top left, 
schematic of experimental design for mosaic labelling in unstressed control 
and Adrb2 knockout mice (red arrows indicate collection of skin samples). 
Bottom left, immunofluorescent staining for GFP (green) and TRP2 (red) from 
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Tyr ®?2-Rosa™ © mice (MeSC-mT/mG) and Tyr"? -Adrb2";Rosa™* mice 
(MeSC-Adrb2 cKO-mT/mG) after treatment with tamoxifen (three times) at first 
telogen. Right, immunofluorescent staining of hair follicles for GFP (green) and 
TRP2 (red) after the mice enter anagen (n =3 mice for each condition). 

e, Quantification of the percentage of white hairs after intradermal injection of 
saline or noradrenaline (n=10 injected sites from 6-8 mice for each condition, 
one-way ANOVA with Tukey’s multiple comparisons test). f, Left, 
immunofluorescent staining of hair follicles for TRP2 (red) from the skins of 
mice that were intradermally injected with noradrenaline (n=30 hair follicles 
from 10 injection sites for each condition, one-way ANOVA with Tukey’s 
multiple comparisons test). Right, quantification of MeSCs. g, White hairs are 
formed after intradermal injection of noradrenaline in KIS“; Adrb2™ mice 
(HF-Adrb2 cKO) (n= 3 injection sites for each condition, two-tailed unpaired 
t-test). Yellow dashed circles denote intradermal injection sites. Scale bars, 

50 um. All dataare mean+s.d. 
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Extended Data Fig. 5| See next page for caption. 
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Extended Data Fig. 5 | Activation of the sympathetic nervous system by 
nociception-induced stress or sympathetic-nerve-specific inducible G,- 
DREADD. a, LC-MS/MS quantification of stress hormones in sham-operated 
and adrenalectomized mice (n=3 mice for each condition, two-way ANOVA 
with Benjamini-Hochberg correction). b, Top, immunofluorescent staining of 
sympathetic nerves inthe skin regions with predominantly black hairs (orange 
box) and with mostly white hairs (green box) (n=3 mice for each condition). 
Bottom, 3D surfaces of tyrosine hydroxylase (TH) staining, created using 
Imaris software, and quantification of the sympathetic nerve volume 

from regions with different numbers of unpigmented hairs (n= 20 hair 
follicles for each region from 3 mice, two-tailed unpaired ¢-test).c, Left, 
immunofluorescent staining of sympathetic ganglia for tyrosine hydroxylase 
(green) and FOS (red) from mice injected with RTX. Cells were collected at 
different time points between O and 24 h. Right, quantification of FOS" cells 
(n=6 sympathetic ganglia from 3 mice for each time point). d, Quantification of 
the efficiency of chemical sympathectomy (n=6 mice for each condition, two- 
tailed unpaired t-test), and percentage of white hairs in RTX-injected mice 


treated with vehicle (ctrl) or 6-OHDA (SN abla) (n=6 mice for each condition, 
two-tailed unpaired t-test). e, Guanethidine (gua) injection blocks the 
formation of white hairs that is induced by RTX injection (quantification for 
percentage of white hairs: n=14 mice for each condition, two-tailed unpaired 
t-test; quantification for numbers of MeSCs: n= 30 hair follicles from 6 mice for 
each condition, two-tailed unpaired t-test). f, Left, immunofluorescent staining 
of sympathetic ganglia for tyrosine hydroxylase (green) and FOS (red) from 

TH ®!2°CAG-LSL-G,-DREADD mice that were injected with CNO; cells were 
collected 6h later. Right, quantification of FOS* cells (n=6 sympathetic ganglia 
from 2 mice for each condition, two-tailed unpaired t-test). g, Formation of 
white hairs after intradermal injection of CNO into TH"; CAG-LSL-G,- 
DREADD mice (n= 6 injection sites from 5 mice for each condition, two-tailed 
unpaired t-test). Yellow dashed circles denote intradermal CNO injection sites. 
h, Quantification of the percentage of white hairs at CNO injection sites in 
mosaically induced TH’“*”?;CAG-LSL-G,-DREADD;Rosa™”“ mice (n=5 injection 
sites from 4 mice for each condition, two-tailed unpaired t-test). Scale bars, 
50m. All dataare mean+s.d. 
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Extended Data Fig. 6 | See next page for caption. 
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Extended Data Fig. 6 | Analysis of apoptosis and proliferation of MeSCs and 
the effect of RTX or noradrenaline on mature melanocytes. a, Left, 
immunofluorescent staining of active caspase-3 (aCAS3; green) and TRP2 (red) 
from mice one day after injection with RTX or noradrenaline. Right, 
quantification of aCAS3* MeSCs (n= 30 hair follicles from 6 mice for each 
condition, one-way ANOVA with Tukey’s multiple comparisons test). b, TUNEL 
assay of hair follicles from mice one day after treatment with RT X or 
noradrenaline. Catagen hair follicles were used as positive controls for TUNEL. 
White arrowhead points to apoptotic hair follicle cells (n= 30 hair follicles from 
6 mice for each condition, one-way ANOVA with Tukey’s multiple comparisons 
test). c, Formation of white hairs in Ripk3 mutant mice (RIPK3 KO) injected with 
RTX (n=5 mice for each condition, two-tailed unpaired t-test). d, Left, 
immunofluorescent staining of hair follicles for the DNA damage marker 
y-H2AX (green) and TRP2 (red) from mice one day after treatment with RTX or 
noradrenaline. Hair follicles from irradiated mice were used as positive 
controls. White arrowhead indicates the MeSCs with DNA damage. Right, 
quantification of y-H2AX* MeSCs (n= 30 hair follicles from 6 mice for each 
condition, one-way ANOVA with Tukey’s multiple comparisons test). e, Left, 


immunofluorescent staining for phosphorylated histone H3 (pHH3; green) and 
TRP2 (red) of control hair follicles at different stages of the hair cycle. Right, 
quantification of pHH3* MeSCs (n= 25 hair follicles from 3 mice for each 
condition, one-way ANOVA with Tukey’s multiple comparisons test). f, Left, 
immunofluorescent staining of hair bulbs for aCAS3 (green) and TRP2 (red) 
from mice one day after injection with RTX or noradrenaline. Right, 
quantification of aCAS3* mature melanocytes (MCs) (n=30 hair follicles from3 
mice for each condition, one-way ANOVA with Tukey’s multiple comparisons 
test). g, Left, immunofluorescent staining of hair bulbs for pHH3 (green) and 
TRP2 (red) from mice one day after injection with RTX or noradrenaline. Right, 
quantification of pHH3* mature melanocytes (n= 30 hair follicles from 3 mice 
for each condition, one-way ANOVA with Tukey’s multiple comparisons test). 
h, Left, schematic of the strategy for isolation of MeSCs. Right, FACS analysis of 
the numbers of MeSCs one day after RTX injection (n=5 mice for each 
condition, two-tailed unpaired ¢-test). i, Fontana-Masson melanin staining of 
anagen or telogen samples five days after injection of saline or RTX (n=6 mice 
for each condition, two-tailed unpaired t-test). Blue arrowheads indicate 
ectopic pigments. Scale bars, 50 pm. All dataare mean+s.d. 
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Extended Data Fig. 7 | Differential gene expression in normal and stressed 
MeSCs. a, FACS strategy for the purification of MeSCs. MeSCs were selected on 
the basis of their expression of CD117, froma population that is negative for 
CD140A, CD45, SCA1and CD34 and that shows modest expression for integrin 
«6. b, Sample clustering based on Pearson's correlation of transcriptomes 
between control and stressed MeSCs (n=2 biologically independent samples 
for each condition). c, Heat map ofall differentially expressed genes (n=2 
biologically independent samples for each condition; Pvalues were calculated 
using the Wald test implemented in DESeq2, and adjusted using the Benjamini- 
Hochberg method. Differentially expressed genes were those that had the 


absolute value of log,(gene expression in stressed versus control MeSCs) 

> 0.58 and adjusted Pvalue < 0.05. d, Expression levels of marker genes for 
different cell types in the skin, confirming the purity of MeSCs that were used 
for RNA-seq (n=4 biologically independent samples). TPM, transcripts per 
million. e, Heat maps showing expression of signature genes that are related to 
the differentiation of MeSCs. f, Heat maps showing expression of cell-cycle 
signature genes. Hn1is also knownas/Jptl. g, (RT-PCR validation of selected 
differentially expressed genes in FACS-purified mouse MeSCs from skins of 
control and RTX-injected mice (n=4 biological replicates for each condition, 
two-way ANOVA with Benjamini-Hochberg correction). Alldataare mean+s.d. 
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Extended Data Fig. 8 | Proliferation analysis of RTX-injected mice that were RTX injection with MeSC-specific overexpression of P27 (MeSC-P27 OE). Right, 


treated with CDK inhibitors chemically or genetically. a, b, Left, quantification of pHH3* cells (n=30 hair follicles from 3 mice for each 
immunofluorescent staining of upper hair follicles (a) and hair bulbs (b) for condition, one-way ANOVA with Tukey’s multiple comparisons test). Scale bars, 
pHH3 (green) and TRP2 (red) from mice one day after RTX injection, RTX 50pm. All dataare mean+s.d. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[| A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


Oo For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection BD FACSDiva (8.0.2), ZEN 2.3 SP1 (black 64 bit) 
Data analysis ImageJ (1.52h), Imaris (x64 9.3.0), Prism (7.00), Trim Galore! (0.4.1), STAR (2.5.3), DESeq2 (1.22.2), FlowJo (v10.0.7), DAVID (6.8), RStudio 
(Version 1.1.453), R (3.5.1) 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Sequence data that support the findings of this study have been deposited in GEO with the accession codes GSE131566 (https://www.ncbi.nlm.nih.gov/geo/query/ 
acc.cgi?acc=GSE131566). 


Source data for all figures are provided with the paper. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x] Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were determined based on prior studies and literatures of the field using similar experimental paradigms. 
Data exclusions No data were excluded from analysis 


Replication For in vivo experiments using rodent, biological replicates and independent cohorts of mice were used; for in vitro experiments, biological 
replicates as well as technical triplicates were used to ensure reproducibility. All attempts at replication were successful. 


Randomization — The mice were randomly assigned into different experimental groups whenever possible, except in experiments required specific genotypes. 


Blinding For LC-MS-MS analysis, RNA-seq library preparation, and sequencing, experimenters were blinded to experimental conditions. Blinding was 
not possible in mouse studies due to the need to identify specific genotypes or treat mice with different chemicals according to experimental 
designs. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used TRP2 (Santa Cruz, CAT 10451, Clone D18, Lot k2114, Dilution 1:400) 
Tyrosine hydroxylase (Millipore, CAT AB152, Clone N/A, Lot 3072361, Dilution 1:1000) 
Tyrosine hydroxylase (Millipore, CAT AB1542, Clone N/A, Lot 3168710, Dilution 1:150-1:300) 
c-Fos (Abcam, CAT 190289, Clone N/A, Lot GR3253255-1, Dilution 1:400) 
y-H2AX (Cell Signaling Technology, CAT 9718, Clone 20E3, Lot 17, Dilution 1:400) 

Phospho-Histone H3 (Cell Signaling Technology, CAT 3377S, Clone D2C8, Lot 7, Dilution 1:250) 

Cleaved Caspase-3 (Cell Signaling Technology, CAT 9664S, Clone 5A1E, Lot 45, Dilution 1:300) 

GFP (Abcam, CAT ab290, Clone N/A, Lot N/A, Dilution 1:800) 

CD3 (eBioscience, CAT 14-0032-81, Clone 17A2, Lot 4342633, Dilution 1:800) 

Cc 

P 


D11b (eBioscience, CAT 14-0112-81, Clone M1/70, Lot E03529-1632, Dilution 1:400) 
hospho-CREB (Cell Signaling Technology, CAT 9198, Clone 87G3, Lot 14, Dilution 1:800) 
ITF (Abcam, CAT ab12039, Clone C5, Lot GR3206967-1, Dilution 1:400) 

CD140a (Invitrogen, CAT 13-1401-82, Clone APAS, Lot 1993111, Dilution 1:200) 

CD45 (Invitrogen, CAT 13-0451-82, Clone 30-F11, Lot 1989134, Dilution 1:400) 

Scal (Invitrogen, CAT 13-5981-82, Clone D7, Lot 4346548, Dilution 1:1000) 

CD34 (Invitrogen, CAT 13-0341-82, Clone RAM34, Lot 1993111, Dilution 1:100) 

CD117 (Biolegend, CAT 135136, Clone ACK2, Lot B251908, Dilution 1:400) 


Validation TRP2 (Santa Cruz, CAT 10451): Mouse, Immunohistochemistry, validated on manufactures's website: http:// 
www.emdmillipore.com/US/en/product/Anti-Tyrosine-Hydroxylase-Antibody, MM_NF-AB152#overview 


Tyrosine hydroxylase (Millipore, CAT AB152): Mouse, Immunohistochemistry, validated on manufactures's website: http:// 
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www.emdmillipore.com/US/en/product/Anti-Tyrosine-Hydroxylase-Antibody, MM_NF-AB152#overview 


Tyrosine hydroxylase (Millipore, CAT AB1542): Mouse, Immunohistochemistry, validated on manufactures's website:http:// 
www.emdmillipore.com/US/en/product/Anti-Tyrosine-Hydroxylase-Antibody, MM_NF-AB1542#overview 


c-Fos (Abcam, CAT 190289, Clone N/A): Mouse, Immunohistochemistry, validated on manufactures's website: https:// 
www.abcam.com/c-fos-antibody-ab190289.html#top-500 


y-H2AxX (Cell Signaling Technology, CAT 9718): Mouse, Immunohistochemistry, validated on manufactures's website: https:// 
www.cellsignal.com/products/primary-antibodies/phospho-histone-h2a-x-ser139-20e3-rabbit-mab/9718 


Phospho-Histone H3 (Cell Signaling Technology, CAT 3377S): Mouse, Immunohistochemistry, validated on manufactures's 
ebsite: https://www.cellsignal.com/products/primary-antibodies/phospho-histone-h3-ser10-d2c8-xp-rabbit-mab/3377 


= 


Cleaved Caspase-3 (Cell Signaling Technology, CAT 9664S): Mouse, Immunohistochemistry, validated on manufactures's website: 
https://www.cellsignal.com/products/primary-antibodies/cleaved-caspase-3-asp175-5a1e-rabbit-mab/9664 


GFP (Abcam, CAT ab290, Clone N/A): Mouse, Immunohistochemistry, validated on manufactures's website: https:// 
www.abcam.com/gfp-antibody-chip-grade-ab290.html?productWallTab=ShowAll 
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CD3 (eBioscience, CAT 14-0032-81, Clone 17A2): Mouse, Immunohistochemistry, validated on manufactures's website: https:// 
ww.thermofisher.com/antibody/product/CD3-Antibody-clone-17A2-Monoclonal/14-0032-82 


s 


CD11b (eBioscience, CAT 14-0112-81, Clone M1/70): Mouse, Immunohistochemistry, validated on manufactures's website: 
https://www.thermofisher.com/antibody/product/CD11b-Antibody-clone-M1-70-Monoclonal/14-0112-82 


Phospho-CREB (Cell Signaling Technology, CAT 9198): Mouse, Immunohistochemistry, validated on manufactures's website: 
https://www.cellsignal.com/products/primary-antibodies/phospho-creb-ser133-87g3-rabbit-mab/9198 


ITF (Abcam, CAT ab12039, Clone C5): Mouse, Immunohistochemistry, validated on manufactures's website: https:// 
www.abcam.com/mitf-antibody-c5-chip-grade-ab12039.html 


CD140a (Invitrogen, CAT 13-1401-82, Clone APAS): Mouse, FACS, validated on manufactures's website: https:// 
www.thermofisher.com/antibody/product/CD140a-PDGFRA-Antibody-clone-APA5-Monoclonal/13-1401-82 


CD45 (Invitrogen, CAT 13-0451-82, Clone 30-F11): Mouse, FACS, validated on manufactures's website: https:// 
www.thermofisher.com/antibody/product/CD45-Antibody-clone-30-F11-Monoclonal/13-0451-82 


Scal (Invitrogen, CAT 13-5981-82, Clone D7): Mouse, FACS, validated on manufactures's website: https:// 
www.thermofisher.com/antibody/product/Ly-6A-E-Sca-1-Antibody-clone-D7-Monoclonal/13-5981-82 


CD34 (Invitrogen, CAT 13-0341-82, Clone RAM34): Mouse, FACS, validated on manufactures's website: https:// 
www.thermofisher.com/antibody/product/CD34-Antibody-clone-RAM34-Monoclonal/13-0341-82 


CD117 (Biolegend, CAT 135136, Clone ACK2): Mouse, FACS, validated on manufactures's website: https://www.biolegend.com/ 
en-us/products/apccyanine7-anti-mouse-cd117-c-kit-antibody-13795 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals C57BL/6J, Tyr-CreER, K15-CrePGR, Rag1 KO, CD11b-DTR, GR flox, Dreadd flox, Rosa-H2BGFP/mCherry, Rosa26-rtTA, Rosa-mTmG, 
RIPK3 knockout, Adrb2 flox, TH-CreER, pTRE-P27. All experiments used balanced groups of male and female mice. All 
experiments are conducted and compared using mice of the same hair cycle stage in comparable age range (P20-P25 for 1st 
telogen, P31-P36 for full anagen, and P50-P60 for 2nd telogen, or long-term monitoring as specified). 


Wild animals This study did not involve wild animals. 
Field-collected samples This study did not involve field-collected samples. 
Ethics oversight All animals were maintained in an Association for Assessment and Accreditation of Laboratory Animal Care-approved animal 


facility at Harvard University, Harvard Medical School, and Ribeirao Preto Medical School. Procedures were approved by the 
Institutional Animal Care and Use Committee of all institutions and were in compliance with all relevant ethical regulations. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Mouse dorsal skin was collected, and the fat layer was removed by gentle scrapping from the dermal side. The skin was 
incubated in 0.25% collagenase in HBSS at 37 °C for 35-45 minutes on an orbital shaker. Single cell suspension was collected by 
gentle scraping of the dermal side and filtering through 70 um and 40 um filters. The epidermal layer was incubated in trypsin- 
EDTA at 37 °C for 35-45 minutes on an orbital shaker. Single cell suspension was collected by gentle scraping of the epidermal 
side and filtering through 70 um and 40 um filters. The single cell suspension was centrifuged for 5 minutes at 4°C, resuspended 
in 0.25% FBS in PBS, and stained with fluorescent dye-conjugated antibodies for 30 minutes. For late anagen skin samples, the 
bottom parts of the hair follicles containing mature melanocytes were removed by gentle scrapping under dissection 
microscope. The MeSCs located close to the bulge remained and were verified by immunostaining. 


Si 
fev) 
a 
S 
= 
o 
= 
o 
Nn 
o 
fev) 
= 
a 
= 
= 
io) 
©) 
oO 
at 
= 
a 
Nn 
S 
=} 
= 
fev) 
5 
< 


Instrument FACS: BD FACSAria III 
Software BD FACSDiva (8.0.2), FlowJo (v10.0.7) 


Cell population abundance __ For sorting for RNA-seq, at least 10,000 MeSCs were collected. The purity was verified by expression of GFP driven by lineage 
specific CreER. 


Gating strategy Positive and negative gates were set using fluorescence minus one (FMO) background intensity controls. Fluorophores were 
chosen to minimize spectral overlap. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Mycobacterium tuberculosis is an intracellular pathogen that uses several strategies to 


interfere with the signalling functions of host immune molecules. Many other 
bacterial pathogens exploit the host ubiquitination system to promote 
pathogenesis’”, but whether this same system modulates the ubiquitination of 

M. tuberculosis proteins is unknown. Here we report that the host E3 ubiquitin ligase 
ANAPC2~—a core subunit of the anaphase-promoting complex/cyclosome—interacts 
with the mycobacterial protein RvO222 and promotes the attachment of lysine-11- 
linked ubiquitin chains to lysine 76 of RVO222 in order to suppress the expression of 
proinflammatory cytokines. Inhibition of ANAPC2 by specific short hairpin RNA 
abolishes the inhibitory effect of RVO0222 on proinflammatory responses. Moreover, 
mutation of the ubiquitination site on RVO222 impairs the inhibition of 
proinflammatory cytokines by RvO222 and reduces virulence during infection in 
mice. Mechanistically, lysine-11-linked ubiquitination of RVO222 by ANAPC2 facilitates 
the recruitment of the protein tyrosine phosphatase SHP1 to the adaptor protein 
TRAF6, preventing the lysine-63-linked ubiquitination and activation of TRAF6. Our 
findings identify a previously unrecognized mechanism that M. tuberculosis uses to 
suppress host immunity, and provide insights relevant to the development of 
effective immunomodulators that target M. tuberculosis. 


To identify M. tuberculosis proteins that inhibit host inflammatory 
responses, we examined the effects of 208 mycobacterial secreted 
proteins and lipoproteins on the activation of NF-KB’, a transcription 
factor that is involved in numerous cellular processes, including inflam- 
mation. Of these mycobacterial proteins, RV0222—a serodiagnostic 
target for tuberculosis*—inhibited NF-KB activation. Consistent with 
this, transient expression of RvO222 in HEK293T cells markedly inhib- 
ited the activation of both NF-KB and AP-1, another transcription factor 
(Extended Data Fig. 1a, b). Transfection with RvO222 also resulted in 
reduced activation of NF-kB signalling in immortalized bone-marrow- 
derived macrophages (iBMDMs) infected with M. tuberculosis, again sug- 
gesting that RVO222 inhibits NF-KB activation (Extended Data Fig. 1c). 


Rv0222 inhibits host immunity 


To further investigate the effect of RVO222 on host immune responses, 
we generated a RvO222-deletion mutant of the M. tuberculosis strain 
H37Rv (H37RVARVO222) (Extended Data Fig. 1d). Primary peritoneal 
macrophages—which serve as both a habitat for, and first line of defence 
against, M. tuberculosis—were infected with H37Rv or H37RVARVO222 
before gene-expression profiling. Macrophages infected with the 


H37RvARVO222 strain had much higher mRNA levels of proinflamma- 
tory cytokines, including interleukin (IL)-1b, IL-6 and IL-12, than those 
infected with wild-type H37Rv (Fig. 1a and Extended Data Fig. le-g), but 
there was no significant difference in cell death (P= 0.76 when infected 
for 12 hand P=0.91 when infected for 24 h) and bacterial entry (P=0.67) 
(Extended Data Fig. 1h, i). Consistent with this finding, the lung tissue 
of mice infected with H37RVARvO222 had much higher levels of mRNAs 
encoding IL-1b, IL-6 and the IL-12 p40 subunit than did the lung tissue 
of mice infected with H37Rv (Fig. 1b-d). Furthermore, complementa- 
tion of H37RvARVO222 with RvO222, but not with green-fluorescent 
protein (GFP), restored the inhibition of //1b, //6 and Il12 p40 expres- 
sion upon infection with H37Rv (Fig. le-g and Extended Data Fig. 1d). 
Together, these results provide further evidence that RvO222 inhibits 
M. tuberculosis-triggered inflammatory responses. 

We next investigated the functional relevance of RvO222 in the patho- 
genesis of M. tuberculosis infection. The lung tissues of CS7BL/6 mice 
infected with H37Rv(ARvO222 + GFP) had less immune-cell infiltra- 
tion and fewer inflammatory lesions than those infected with H37Rv 
(Fig. 1h). AS shown above, H37RvARvO222 induced a higher level 
of /[12 p40 expression than did wild-type H37Rv (Fig. 1d). IL-12 p40 
has been reported to increase the differentiation of T-helper-1 (T,,1) 
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Fig. 1|Rv0222 inhibits host inflammatory responses. a, Heat map of 
downregulated (blue) and upregulated (red) mRNAs from peritoneal 
macrophages infected with H37Rv (n= 2) versus ARVO222 (n=2) (multiplicity of 
infection (MOI) =5) for 4h. Numbers 1and 2 refer totwo independent 
experimental analyses of log2(ARvO222/H37Rv). b-d, Quantitative 
polymerase chain reaction (qPCR) analysis of /l1b (b), 16 (c) and /112 p40 (d) 
mRNA from the lungs of C57BL/6 mice aerosol-infected with roughly 200 
colony-forming units (CFUs) per mouse of H37Rv or ARvO222 for 7 days. The 
graphs show cumulative data from three independent experiments 

(mean +s.e.m. of n=18), with red, blue and yellow circles denoting separate 
experiments. e-g, qPCR analysis of //1b (e), //6 (f) and /112p40 (g) mRNA 
from peritoneal macrophages infected with H37Rv, H37Rv + GFP, 
H37Rv(ARvO222 + GFP) or H37RV(ARVO222 + RvO222) for3hor6h(MOI=5) 


cells during M. tuberculosis infection®; T,,1 cells produce interferon 
(IFN)y; and IFNy inhibits the accumulation of pathogenic neutro- 
phils in infected lungs during M. tuberculosis infection®’. Hence, the 
reduced immune-cell infiltration that we observed in the lungs of 
H37RvARvO222-infected mice could be due to a higher level of IFNy- 
producing T,,1 cells. The bacterial burden in the lung tissue of mice 
infected with H37Rv(ARvO222 + RvO222) was much higher thanin mice 
infected with H37Rv(ARvO222 + GFP) (Fig. liand Extended Data Fig. 1j). 
Together, these results suggest that RvO222 is an essential virulence 
factor for M. tuberculosis. 


1 7 
Time (days) 


30 


(mean + s.e.m.). Data are representative of one experiment with at least three 
independent biological replicates; each circle represents one technical 
repeat. Bar charts show means. h, i, CS57BL/6 mice were aerosol-infected 

with roughly 200 CFUs per mouse of H37Rv, H37Rv(ARvO222 + GFP) or 
H37Rv(ARVO222 + RvO222). Histopathology was assessed in lung sections 
stained with haematoxylin and eosin from mice infected for 30 days 

(h; representative of one experiment with at least three independent 
replicates; scale bars, 1,000 pm (top) and 200 pm (bottom)), and bacterial 
titres were assessed in lungs from mice infected for 1, 7 or 30 days (i; cumulative 
data from three independent experiments; mean +s.e.m. of n=9 miceinfected 
for 1day and n=18 mice infected for 7 or 30 days). Two-tailed 

unpaired Student’s t-test (b-g) and two-sided Mann-Whitney U-test (i) were 
used for statistical analysis. For gel source data, see Supplementary Fig. 1. 


Rv0222 inhibits TRAF6 


Various pattern-recognition receptors (PRRs) in hosts regulate M. tuber- 
culosis-induced cytokine expression by activating mitogen-activated 
protein (MAP) kinase and NF-kB pathways. We found that, inthe absence 
of RvO222, macrophages infected with H37Rv exhibited much stronger 
activation of MAP kinase and NF-KB (Extended Data Fig. 2a), suggesting 
that RvO222 may inhibit the activation of pathways that comprise the p38 
MAP kinases, the c-Jun amino-terminal kinases (JNKs, which are down- 
stream of MAP kinases) or NF-kKBin response to M. tuberculosis infection. 
Inhibition of p38, JNK or NF-kB attenuated the enhanced expression levels 
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Fig. 2| RvO222 interacts with and inhibits TRAF6 signalling. 

a, b, Immunoblot (IB) andimmunoprecipitation (IP) of lysates of HEK293T cells 
transfected with various plasmids as indicated. c, Endogenous interaction of 
Rv0222 with TRAF6 in H37Rv(ARVO222 + RvO222)-infected peritoneal 
macrophages (MOI=S5).d, Left, images show confocal microscopy of peritoneal 
macrophages infected with H37Rv(ARv0222 + RvO222) for 3h (MOI=5). Scale 
bar, 5 um. Right, the co-localization ratio from 60 macrophages (mean+s.e.m.). 
e, Peritoneal macrophages were infected with H37Rv or ARvO222 for the 


of /[1b and //6 in macrophages infected with H37RVARVO222 (Extended 
Data Fig. 2b, c), suggesting that Rv0222 may inhibit the M. tuberculosis- 
induced expression of proinflammatory cytokines by downregulating 
the activation of the p38, JNK and NF-KB pathways. 

M. tuberculosis infection induces the expression of proinflamma- 
tory cytokines via the Toll-like receptor (TLR) signalling pathway®’, 
whichis composed of several adaptor proteins, namely tumour necro- 
sis factor (TNF)-receptor-associated factor 6 (TRAF6), transforming 
growth factor-B (TGFB)-activated kinase 1 (TAK1) and TAK1-binding 
protein 1 (TAB1)"°. We found that TRAF6 interacts with RvO222 (Fig. 2a, 
b and Extended Data Fig. 2d). Moreover, through co-immunoprecip- 
itation and confocal microscopy analysis of macrophages infected 
with H37Rv(ARvO222 + RvO222), we observed an endogenous inter- 
action between Rv0222 and TRAF6 (Fig. 2c, d). Among TRAF-family 
proteins, RvO222 associates with TRAF1 and TRAF6 (Extended Data 
Fig. 2d). RvO222 efficiently blocked the TRAF6-triggered, but not the 
TAK1-plus-TAB1-triggered, activation of NF-KB and AP-1 (Extended Data 
Fig. 2e-h). Knockdown of TRAF6 by specific shRNA markedly attenu- 
ated the inhibitory effect of RVO222 on the expression of /l1b, /l6 and 
1112 p40 (Extended Data Fig. 2i-k). Consistent with this, knockdown of 
TAK1iniBMDMs by specific shRNA also markedly reduced the RvO0222- 
mediated inhibitory effect on the expression of /11b, /I6 and Il12 p40 
(Extended Data Fig. 2l-n). However, TRAF1 had no marked effect on 
the activation of the NF-KB or MAP kinase signalling pathways, or on 
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indicated times (MOI=5); cell lysates were immunoprecipitated with an anti- 
TRAF6 antibody; and immunoprecipitates were analysed with an anti-K63-Ub 
antibody. f, g, Immunoblot and immunoprecipitation of lysates from HEK293T 
cells transfected with plasmids as indicated. Dataina-g represent one 
experiment with at least three independent biological replicates. For gel 
source data, see Supplementary Fig. 1. 


cytokine expression in M. tuberculosis-infected macrophages (Extended 
Data Fig. 20-r). Together, these data suggest that RVO222 may inhibit 
the activation of host MAP kinase and NF-kB signalling by targeting 
the TRAF6 pathway. 

Activation of TRAF6 is a ubiquitination-dependent process’. We 
found that overexpression of Rv0222 markedly inhibited both total 
ubiquitination and K63-linked ubiquitination of TRAF6 in HEK293T 
cells (Extended Data Fig. 2s, t), whereas deletion of RV0222 markedly 
enhanced K63-linked ubiquitination of TRAF6 in macrophages infected 
with H37Rv (Fig. 2e). The protein tyrosine phosphatases SHP1 and SHP2 
(hereafter SHP1/2) have been reported to interact with TRAF6 and inhibit 
its ubiquitination”. Therefore, we next examined the effect of RVO222 on 
the interaction between SHP1/2 and TRAF6. In HEK293T cells, RvO222 
promoted the binding of SHP1/2 with TRAF6 (Fig. 2f, g). We generated 
iBMDMs in which Shp] and Shp2were stably knocked down (Extended 
Data Fig. 2u), and found that silencing of Shp1/2by specific shRNA elimi- 
nated the inhibitory effect of RVO222 on cytokine expression (Extended 
Data Fig. 2v-y). Together, these data suggest that RvO222 facilitates the 
association of SHP1/2 with TRAF6, thus inhibiting TRAF6 activation. 


K11-linked Rv0222 ubiquitination by ANAPC2 


Given that TRAF6 serves as a classical E3 ligase for the K63-linked 
polyubiquitination of multiple signalling molecules”, and that RvO222 
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Fig. 3|RVv0O222is K11-ubiquitinated at K76 by host ANAPC2. a, Immunoblot and 
immunoprecipitation of lysates from peritoneal macrophages infected with 
H37Rv(ARVO222 + RvO222) for the indicated times (MOI=5).b, Immunoblot 
andimmunoprecipitation of lysates of HEK293T cells transfected with 

plasmids encoding Flag—Rv0222 and Myc-ANAPC2.c, Immunoblot and 
immunoprecipitation of lysates of HEK293T cells transfected with plasmids 
encoding HA-tagged K11-linked ubiquitin (K11-Ub), Flag-tagged RvO222 and Myc- 
tagged ANAPC2.d, The E3 ligase APC/C produces K11-linked ubiquitin chains on 
Rv0222in vitro. e, Locations of lysine residues (red) onthe RvO222 structure 


interacts with TRAF6, we next examined whether Rv0222 is ubiqui- 
tinated in host cells. We first detected the possible polyubiquitina- 
tion of Rv0222 in HEK293T cells (Extended Data Fig. 3a). Seven lysine 
residues in ubiquitin—K6, K11, K27, K29, K33, K48 and K63—have been 
reported to form polyubiquitin chains® ”. We therefore co-trans- 
fected haemagglutinin (HA)-tagged wild-type ubiquitin (containing 
all seven lysine residues) or sequential ubiquitin substitution mutants 
(with only one of the seven lysine residues retained as lysine, and 
the other six replaced with arginine)with Flag-tagged RvO222 into 
HEK293T cells. A co-immunoprecipitation assay showed that the 
K11 ubiquitin mutant markedly increased the polyubiquitination 
of RvO222 (Extended Data Fig. 3b). In addition, we detected endog- 
enous K11-linked polyubiquitin conjugates on RvO222 in macrophages 
infected with H37Rv(ARvO222 + RvO222) (Fig. 3a and Extended Data 
Fig. 3c). Thus, RVO222 may undergo K11-linked polyubiquitination 
inhost cells. 

We therefore hypothesized that a host factor is likely to be involved 
inits polyubiquitination. ANAPC2, BIRC2 and RNF7 are three known 
host E3 ligases with K11-linked polyubiquitination activity’. Co- 
immunoprecipitation showed that ANAPC2 interacted with RvO222 
(Fig. 3b and Extended Data Fig. 3d), and markedly increased the K11- 
linked polyubiquitination of RVO222 (Fig. 3c). We further performed 
biochemical confirmation of ANAPC2-mediated ubiquitination of 
Rv0222 in vitro (as described previously”), and found that K11- 
linked polyubiquitination conjugates of RvO222 were formed only 
in the presence of ANAPC2 (Fig. 3d), suggesting an essential role of 
this ligase in mediating K11-linked polyubiquitination of RvO222 
directly. It has been reported that the anaphase-promoting com- 
plex/cyclosome (APC/C) assembles K11/K48-branched chains". 
Using a K11/K48-bispecific antibody, we did not observe notable 
amounts of K11/K48-branched chains on RvO222 (Extended Data 
Fig. 3c, e), suggesting that RVO222 is mainly ubiquitinated with K11- 
linked polyubiquitin. 
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(Protein Data Bank code 5KJP or 6LDZ; structure generated with PyMOL, 
version 2.0). f, Immunoblot and immunoprecipitation of lysates of HEK293T 
cells transfected with plasmids encoding HA-tagged K11-Ub, Myc-tagged 
ANAPC2 and Flag-tagged RvO222 or Flag-tagged RvO222(K76A). 

g, Immunoblot and immunoprecipitation of lysates of control (scramble) or 
ANAPC2-knockdown HEK293T cells transfected with plasmids encoding HA- 
tagged K11-Ub and Myc-tagged Rv0222 or Myc-tagged RvO222(K76A). Data in 
a-d, f, gare representative of one experiment with at least three independent 
biological replicates. For gel source data, see Supplementary Fig. 1. 


Ubiquitination of RvO222 inhibits immunity 


To determine the location of the lysine residue on RvO222 responsible 
for the attachment of K11-linked polyubiquitin, we examined the struc- 
ture of RvO222 (Protein Data Bank code 5KJP or 6LDZ) and identified 
nine lysine residues (Fig. 3e). Substitution of RVO0222 K76 with alanine 
nearly abrogated K11-linked ubiquitination, suggesting that K76 is 
essential for such ubiquitination (Fig. 3f). Furthermore, overexpres- 
sion of ANAPC2 markedly enhanced the K11-linked ubiquitination of 
Rv0222 but not of RVO222(K76A) in HEK293T cells (Fig. 3f). In addition, 
shRNA-mediated silencing of ANAPC2 markedly inhibited K11-linked 
polyubiquitination of RvO222 in HEK293T cells (Fig. 3g). Thus, host 
ANAPC2 may enhance the K11-linked polyubiquitin of M. tuberculosis 
Rv0222 at K76. 

We next investigated the role of K11-linked RvO222 polyubiquit- 
ination in inhibition of TRAF6 signalling. In HEK293T cells, TRAF6- 
induced activation of the NF-KB and AP-1 reporter genes was 
markedly inhibited by RvO222, but not by mutant RvO222(K76A) 
(Extended Data Fig. 4a, b). Moreover, RV0222(K76A) lost its inhibi- 
tory effect on K63-linked polyubiquitination of TRAF6 in HEK293T 
cells (Extended Data Fig. 4c). Levels of proinflammatory cytokine 
mRNAs were much lower in primary peritoneal macrophages infected 
with the H37Rv(ARvO222 + RvO222) strain than in those infected 
with H37Rv(ARvO222 + GFP), but were much higher in macrophages 
infected with H37Rv(ARvO222 + RvO222(K76A)) than in those infected 
with H37Rv(ARv0O222 + RvO222) (Fig. 4a—c). The cytokine levels were 
not completely restored when the mutant strain was complemented 
withthe K76A mutant, suggesting that RVO222 may regulate the expres- 
sion of //1b and //6 through an additional K76-independent mechanism. 
Furthermore, the expression of proinflammatory cytokines in lung 
tissue from both H37RvARvO222-infected C57BL/6 and severe com- 
bined immunodeficient (SCID) mice’’ was substantially higher than 
in H37Rv-infected mice, confirming the inhibitory effect of RV0222 on 
cytokine expression in vivo. However, infection with H37RVARVO222 
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Fig. 4| K11-linked ubiquitination of RVO222 by ANAPC2 suppresses anti- 
tuberculosis immunity. a—c, qPCR analysis of //7b (a), //6 (b) and /112 p40 (c) 
mRNA from peritoneal macrophages infected with H37Rv, H37Rv(ARVO222 + 
GFP), H37Rv(ARVO222 + RvO222) or H37Rv(ARVO222 + RVO222(K76A)) for 

the indicated times (MOI=5) (mean +s.e.m.). Data are representative of 

one experiment with at least three independent biological replicates; 

each circle represents one technical repeat. d-i, Six-week-old female 
C57BL/6 mice were aerosol-infected with roughly 200 CFUs per mouse 

of H37Rv, H37Rv(ARvO222 + GFP), H37Rv(ARVO222 + RvO222) or 
H37Rv(ARvO222 + RvO222(K76A)). d-f, qPCR analysis of /l1b (d), //6 (e) and /112 
p40 (f) mRNA in lungs from mice infected for 0, 1,7 and 14 days. Cumulative 
data from three independent experiments (mean +s.e.m. ofn=6 mice 
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untreated, n=9 mice infected for 1 day and n=18 mice infected for 7 and 14 
days). g-i, For lungs of mice infected for 30 days (g, i) or land 30 days (h), we 
assessed: histopathology in sections stained with haematoxylin and eosin 

(g; scale bars, 1,000 pm (top) and 200 pm (bottom)); bacterial titres 

(h; cumulative data from three independent experiments, mean+s.e.m. of 
n=6miceinfected for 1day and n=18 mice infected for 30 days); and acid-fast 
staining (i; scale bars, 100 pm (top) and 20 pm (bottom)). The outlined areasin 
the top images are enlarged in the bottom images. Two-tailed unpaired 
Student's t-test (a-f) and two-sided Mann-Whitney U-test (h) were used for 
statistical analysis. g,i, Data are representative of one experiment with at least 
three independent biological replicates. 


complemented with Rv0222, but not with RvO222(K76A), induced 
much lower expression of proinflammatory cytokines in the lungs of 
M. tuberculosis-infected mice (Fig. 4d—f and Extended Data Fig. 4d, 
e). Together, these results suggest that K76 is essential for RvV0222 to 
inhibit host proinflammatory cytokines. 

We also found that RvO222(K76A) did not bind as efficiently as wild- 
type RvO222 to either TRAF6 or SHP1 (Extended Data Fig. 4f, g). Con- 
sistent with this, overexpression of RvO222 rather than RVO222(K76A) 
in HEK293T cells markedly enhanced the association between SHP1 
and TRAF6 (Extended Data Fig. 4h). Thus, the increased RvO222- 
induced binding of SHP1 to TRAF6 may require K11-linked polyubiq- 
uitination on K76 of RvO222. Although further functional studies are 
required to elucidate the mechanism by which K11-linked polyubiq- 
uitination of RvO222 leads to an enhanced interaction between SHP1 
and TRAF6 and hence tothe inhibitory effect of RVO222, one possible 
explanation is that the polyubiquitin chains are required directly 
to form a RvO222-SHP1-TRAF6 complex. Alternatively, K11-linked 
polyubiquitination might cause a conformational change in RvO222 
that renders it more accessible for the binding of TRAF6 and SHPI1, 
enhancing their interaction and inhibiting the K63-linked polyubiq- 
uitin of TRAF6. 

Given that ANAPC2 promotes the K11-linked polyubiquitination 
of RvO222 at K76, we next investigated whether ANAPC2 is involved 
in suppressing TRAF6 signalling by RvO222. In ANAPC2-knockdown 
HEK293T cells, TRAF6-induced NF-kB luciferase reporter activity 
was not substantially inhibited by RvO222 (Extended Data Fig. 4i). 
Moreover, silencing of ANAPC2 with specific shRNA markedly reduced 
the binding of Rv0222 with TRAF6 in HEK293T cells (Extended Data 
Fig. 4j). ANAPC2 knockdown also impaired the inhibitory effect of 
Rv0222 on K63-linked polyubiquitination of TRAF6 in HEK293T 
cells (Extended Data Fig. 4k). Furthermore, in ANAPC2-knockdown 
HEK293T cells, RVO222 was considerably less able to enhance the asso- 
ciation of SHP1 with TRAF6 (Extended Data Fig. 41). Finally, we found 
that infection with H37RVARVO222 induced much higher proinflam- 
matory cytokine expression than did infection with H37Rv in THP1 
cells, but ANAPC2-knockdown THPI cells infected with H37RVARVO222 
showed an even lower level of proinflammatory cytokine expression 
than did those cells infected with H37Rv (Extended Data Fig. 4m-o). 
Silencing of ANAPC2 by specific shRNA led to a marked reduction in 
cytokine levels in both H37Rv- and H37RVARvVO222-infected cells, sug- 
gesting that ANAPC2 may also regulate the expression of cytokines 
in a RvO222-independent manner. We also observed that knock- 
down of ANAPC2 by specific shRNA in iBMDMs almost eliminated 
the H37Rv-induced activation of MAP kinases and NF-kB (Extended 
Data Fig. 4p). ANAPC2 is an important host E3 ligase, but it remains 
to be seen whether ANAPC2 is involved in activating the inflamma- 
tory signalling pathway through other mechanisms. Together, our 
results suggest that the host ubiquitin ligase ANAPC2 regulates the 
inhibitory effect of RVO222 on cytokine expression by promoting its 
K11-linked polyubiquitination. 


Virulence of RvO222 requires ubiquitination 


To investigate the pathological relevance of the K11-linked poly- 
ubiquitination of RvO222 in M. tuberculosis infection, we challenged 
C57BL/6 and SCID® mice with wild-type H37Rv, H37Rv(ARvO222 + GFP), 
H37Rv(ARVO222 + RvO222) or H37RV(ARVO222 + RvO222(K76A)), 
and examined histopathology and bacterial burden in the lungs. For 
C57BL/6 mice, infection with H37RVARvO222 complemented with 
wild-type RvO222 (rather than with the K76A mutant) led to more- 
severe histological damage and more immune-cell infiltration into 
the lungs (Fig. 4g). For SCID mice, at eight weeks after infection, 
almost all mice infected with H37Rv or H37Rv(ARvO222 + RvO222) 
died, but those infected with H37Rv(ARvO222 + GFP) or H37Rv 
(ARVO222 + RvO222(K76A)) lived for much longer (Extended Data 


Fig. 5a). Similarly, the growth defects of H37Rv(ARVO222 + GFP) inthe 
lung tissue of infected C57BL/6 mice or SCID mice were rescued when 
the strain was complemented with wild-type RvO222 but not with the 
K76A mutant (Fig. 4h, iand Extended Data Fig. 5b), suggesting that the 
virulence of H37Rv bacteria requires the K11-linked ubiquitination of 
Rv0222 at K76. 

Our findings identify RvO222 as a previously unrecognized com- 
ponent of the immunological signature of M. tuberculosis. Previous 
studies have shown the accumulation of ubiquitin on host membra- 
nous structures containing M. tuberculosis” **, and the interaction of 
a bacterial ubiquitin-binding protein with a host signalling molecule’; 
however, mycobacteria possess only a pupylation system for pro- 
tein post-translational modification”, rather than an endogenous 
ubiquitination system. Our results reveal a mechanism for subverting 
host immunity that depends on the ubiquitination of M. tuberculo- 
sis RvO222 by a host E3 ligase, ANAPC2, highlighting the versatility 
of host-M. tuberculosis interactions (Extended Data Fig. 6). Future 
research will expand to the role of ubiquitination mediated by other 
host E3 ligases’*”>. Notably, the K11-linked ubiquitination of RvO222 by 
ANAPC2 seems to havea different cellular effect to the APC/C-triggered 
degradation of many cell-cycle regulators”: the RvO222 ubiquitina- 
tion promotes the binding of phosphatases to immune signalling 
molecules, rather than substrate recognition by the proteasome". 
Our findings could pave the way to the development of effective anti- 
tuberculosis treatments that target the RVO222—-ANAPC2 interface. 
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Methods 


Bacterial strains and cells 
Bacterial strains are described in Supplementary Table 1. Mycobacterial 
strains were grown in Middlebrook 7H9 broth supplemented with 10% 
oleic acid—albumin-dextrose-catalase (OADC) and 0.05% Tween-80 
(Sigma) or Middlebrook 7H10 agar (BD) supplemented with 10% OADC. 
When required, the antibiotic kanamycin or hygromycin was added ata 
concentration of 100 pg mI for mycobacterial strain selection. Myco- 
bacteria cultures were grown up to mid-log phase (an optical density 
at 600 nm of approximately 0.6). Aliquots of bacterial cultures were 
prepared in 20% glycerol and Middlebrook 7H9 medium and preserved 
at -80 °C. The CFU per ml of the stocks was titred by plating serial dilu- 
tions on Middlebrook 7H10 agar plates plus 10% OADC. These stocks 
were used for all subsequent infections of macrophages and mice. 
HEK293T cells (ATCC CRL-3216) and iBMDMs (provided by F. Shao, 
National Institute of Biological Sciences, Beijing) were maintained in 
Dulbecco’s modified Eagle’s medium (DMEM; HyClone) supplemented 
with 10% (v/v) heat-inactivated fetal bovine serum (FBS, Gibco) and 
100 U mI" penicillin and streptomycin. Macrophages and THP1 cells 
(ATCC TIB-202) were cultured in RPMI-1640 medium supplemented 
with 10% (v/v) FBS. Peritoneal macrophages were obtained from mice 
three days after injection of thioglycollate (BD). We routinely tested 
all cells for mycoplasma contamination, and used only those cells that 
tested negative for further study. 


Plasmids, reagents and antibodies 

Plasmids are described in Supplementary Table 1. The following anti- 
bodies were used for western blot or immunoprecipitation: from 
Sigma-Aldrich, monoclonal mouse anti-Flag M2 affinity gel (cata- 
logue number/clone: A2220/M2), rabbit anti-HA antibody (H6908/ 
polyclonal), rabbit anti-GAPDH antibody (SAB2701826/polyclonal), 
rabbit anti-Flag antibody (F7425/polyclonal), rabbit anti-Myc antibody 
(C3956/polyclona) and rabbit anti-K11 antibody (MABS107-1/2A3/2E6); 
from Cell Signaling Technology, rabbit anti-APC2 antibody (12301/ 
polyclonal), rabbit anti-phosphorylated (p)-p65 antibody (3033/93H1), 
rabbit anti-p-p38 antibody (9215/3D7), rabbit anti-p-Jnk antibody 
(9251/polyclonal), rabbit anti-K63-polyubiquitin antibody (5621/poly- 
clonal), anti-rabbit IgG horseradish peroxidase (HRP)-linked antibody 
(7074) and anti-mouse IgG HRP-linked antibody (7076); from Santa 
Cruz Biotechnology, rabbit anti-TRAF6 antibody (sc-7221/polyclonal). 
Antibody against K11/K48-branched ubiquitin chains was provided 
by M. Rape (Howard Hughes Medical Institute) and M. Matsumoto 
(Genentech)". The rabbit polyclonal antibody against Rv0222 was 
generated by immunization of rabbits with purified histidine-tagged 
Rv0222 fusion protein, in collaboration with ABclonal Biotech. For 
western blot, all primary antibodies were diluted 1/1,000 and sec- 
ondary antibodies were diluted 1/2,000. For immunofluorescence, 
monoclonal mouse anti-Flag M2 antibody was used at 1/500 dilution 
and rabbit anti-TRAF6 at 1/100 dilution. Corresponding Alexa Fluor 
633- or Alexa Fluor 555-labelled anti-rabbit or anti-mouse antibodies 
(Invitrogen) were used as secondary antibodies (1/1,000 dilution). 
Fluorescein isothiocyanate (FITC) for strain dyeing was from Sangon 
Biotech (catalogue number 3326-32-7). 


Construction of Mycobacteria strains 

We used the pYUB854 system” to generate the H37Rv strain with dele- 
tion of the gene encoding Rv0222 (H37RVARVO222). In brief, upstream 
flanking sequence (UFS) and downstream flanking sequence (DFS) 
either side of the RvO222 gene were amplified from H37Rv genomic 
DNA. Following purification, the PCR products of UFS and DFS were 
ligated into the pGEM-Tamp vector (Promega), and the resulting plas- 
mid was transformed into competent £. coli DHSa cells, which were 
plated on LB agar plus 100 pg ml‘ ampicillin. Following overnight 
incubation of isolated white colonies in LB broth plus ampicillin, 


PCR analysis was used as above to identify successful ligations. The 
cloned UFS or DFS fragment was then subcloned into pYUB854 
digested with the appropriate restriction enzyme. The temperature- 
sensitive phage pHA159 and the pYUB854 vector, containing both 
UFS and DFS fragments for a particular RVO222 gene disruption, 
were digested with Pacl; following gel extraction, these fragments 
were ligated using T4 DNA ligase to create a shuttle plasmid, which 
was then in vitro packaged using the Lambda DNA Packaging Sys- 
tem (Promega) and transformed into E£. coli HB101 cells. Successful 
plasmids were identified by Pacl digestion, followed by antibiotic 
selection on LB plus hygromycin (100 pg mI") plates. The plasmid 
was then transduced into M. smegmatis (mc*155) at the permissive 
temperature of 30 °C, which allows replication and lysis, and a high- 
titre lysate was produced (more than 10” mI’). Transduction into 
H37Rv was then carried out with the high-titre lysate at an MOI of 10 at 
the nonpermissive temperature of 37 °C, which results in the delivery 
of the substrate. Cells-only and phage-only controls were included. 
Following incubation for two to three weeks on 7H10 plus OADC, 
glycerol and hygromycin, colonies were picked and grown in liquid 
medium (7H9 plus OADC, glycerol, Tween-80 and hygromycin) for 
two weeks; then, gene deletion was confirmed as above via PCR using 
primers outside the upstream and downstream flanking regions, with 
ashiftin size indicating a mutant. Immunoblot analysis was also used 
to confirm the deletion of RvO222 in the H37Rv strain. The shuttle 
vector pMV26]1 (provided by K. Mi, Institute of Microbiology, Bei- 
jing, China) was used to complement the strain H37RVARVO222 with 
wild-type RvO222 (with a carboxy-terminal Flag-tag) or to create the 
strain H37Rv RvO222(K76A) (with a C-terminal Flag-tag). Expression 
of RvO222 or its mutants (with a C-terminal Flag-tag) in mycobacteria 
was examined by immunoblot analysis. 


Transfection and confocal microscopy 

HEK293T cells were transiently transfected using Lipofectamine 2000 
(catalogue number 11668; Invitrogen). Confocal microscopy was per- 
formed as described”. 


Luciferase assay 

The dual-luciferase reporter assay system (Promega) was used for lucif- 
erase assays. HEK293T cells were transiently transfected with pNF-KB- 
luc, p-AP-1-luc, pRL-TK and indicated plasmids for 24 h. 


Reverse transcription-PCR analysis 
RNA preparation and qPCR analysis were performed as described using 
gene-specific primers (Supplementary Table 1). 


Immunoprecipitation and immunoblotting 

For immunoprecipitation, HEK293T cells were transiently trans- 
fected using Lipofectamine 2000. After 48 h, cells were washed with 
phosphate-buffered saline (PBS) and then lysed in cell-lysis buffer for 
western blotting and immunoprecipitation (Beyotime), supplemented 
with 1% protease-inhibitor cocktail (catalogue number P8340, Sigma). 
After 30 min onice, the lysates were centrifuged at 13,523g for 10 min at 
4°Ctoremove debris. The cell lysates were incubated with anti-Flag M2 
affinity gel overnight at 4 °C. For endogenous immunoprecipitation, 
primary peritoneal macrophages were stimulated with mycobacteria 
for the indicated time periods. The cells were lysed, and the lysates 
were incubated with anti-TRAF6 antibody and protein A/G (sephrasose) 
overnight at 4 °C. The sepharose samples were centrifuged, washed 
five times with cell-lysis buffer and boiled with SDS loading buffer for 
10 min. After separation by SDS-PAGE, equivalent amounts of protein 
were electroblotted onto nitrocellulose membranes. The membrane 
was blocked, incubated with primary antibodies, and washed three 
times before incubation with secondary antibody. After a final wash, 
analysis was conducted using enhanced chemiluminescence reagent 
(Thermo Scientific). 
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Mice and infection 

All animal experiments were reviewed and approved by the Animal 
Experiment Administration Committee of Tongji University School of 
Medicine and were conducted in accordance with the National Insti- 
tutes of Health (NIH) Guidelines for the Care and Use of Laboratory 
Animals. Female SPF C57BL/6 and SCID mice were purchased from 
Slaccas. Six- to eight-week-old female C57BL/6 mice were used for 
macrophage separation. Six-week-old C57BL/6 and SCID female mice 
were divided randomly into cages and infected by an aerosol method 
with approximately 100-200 CFUs of different H37Rv strains for four 
weeks (using an inhalation exposure system from Glas-col)”*” at the 
Biosafety Level-3 (BSL-3) Laboratory. Mice were killed at 1, 7, 14 or 30 
days after infection. Lungs were collected and homogenized in1 ml of 
PBS. Mycobacterial burden was determined by plating tenfold serial 
dilutions of each tissue homogenate on Middlebrook 7H10 agar plates. 
Colonies were counted after three to four weeks of incubation at 37 °C 
in 5% CO,. For histology, lungs were removed, fixed in 4% paraformal- 
dehyde in PBS and embedded in paraffin. Sections were cut and stained 
with haematoxylin and eosin (H&E) or Ziehl-Neelsen stain (acid-fast 
bacillus) according to standard protocols. Imaging was performed by 
microscopy (TCS CP5Il, Leica). All mice were age- and sex-matched in 
each experiment. Sample size was based on empirical data from pilot 
experiments. For in vitro M. tuberculosis infections, macrophages were 
infected with a single-cell suspension of bacteria at an MOI of 5. 


shRNA-mediated gene interference 

Short hairpin RNA inthe pLKO.1 vector was introduced into target cells 
via lentiviral transduction. pLKO.1-TRAF6-1 and pLKO.1-TRAF6-2 were 
gifts from G. Meng”. The targeting sequences were: human ANAPC2 
shRNA, 5’-GCAGAT TAAAGCAAGTCAGAT-3’; mouse Anapc2 shRNA, 
5’-GCGACGTTCTTCAGACATCAT-3’; mouse Tak1 shRNA, 5’-GCGCC 
TGAAGTATTTGAAGGT-3’; mouse Shp shRNA, 5’-GCTAGACTGTG 
ACATTGATAT-3’; mouse Shp2 shRNA, 5’-CGTGT TAGGAACGTCAA 
AGAA-3’; mouse Trafl shRNA, 5’-GCATCCTTTGATGGTACTTTC-3’. 
HEK293T cells were transfected with pSPAX2, pMD2.G and PLKO.1con- 
taining shRNAs targeting different genes. Lentiviruses were collected 
48 hlater and were used to infect iBMDMs (for ANAPC2, TRAF6, TRAFI, 
TAK1, SHP1 and SHP2 knockdown) or HEK293T cells and THPI cells 
(for ANAPC2 knockdown). Subsequently, selection with puromycin 
(2 ug mI) was carried out. 


Ubiquitination assay 

HA-tagged ubiquitin plasmids were gifts from X. Cao”. HEK293T cells 
were transiently transfected with the indicated plasmids for 48 h. 
Rv0222 or TRAF6 was immunoprecipitated with anti-Flag M2 affinity gel 
or protein A/G plus anti-TRAF6 antibody, and the obtained complexes 
were washed three times with urea (8 M, in PBS) to separate proteins 
bound by noncovalent bonds. Ubiquitinated RvO222 or ubiquitinated 
TRAF6 was detected by immunoblot analysis using HA-specific anti- 
bodies or specific ubiquitin antibody. For in vitro experiments, His- 
Rv0222 was purified from F. coliBL21 and APC/C was purified from 
G1 HEK293T cells by anti-CDC27 antibody (Santa Cruz) and protein 
A/G (Santa Cruz) at 4 °C. The beads were washed three times with SB 
(25 mM HEPES pH 7.5, 1.5 mM MgCl, 5 mM KCI, 1 mM dithiothreitol 
(DTT), 1x protease inhibitor, 15 mM creatine phosphate, 2mM ATP) plus 
0.05% Tween-20, and twice with SB before being resuspended in 40 ml 
SB. Beads were incubated with 100 nM E1 (Boston Biochem), 125 nM 
UbcH10 (Boston Biochem), 1.5 mg mI ubiquitin, 20 mM ATP, 15 mM 
creatine phosphate, creatine phosphokinase, 1mM DTT in UBAB buffer 
(25 mM Tris/HCl pH 7.5, 50 mM NaCl, 10 mM MgCl, ). The reaction was 


started when RvO222 was added, incubated at 30 °C for 1h, stopped 
when loading buffer was added and analysed by western blot with anti- 
ubiquitin antibody. 


Statistical analysis 

Statistical significance between groups was determined by two-tailed 
Student’s t-test, two-tailed analysis of variance followed by Bonferroni 
post hoc test or two-sided Mann-Whitney U-test. Differences were 
significant at P< 0.05. The experiments were not randomized, andthe 
investigators were not blinded to allocation during experiments and 
outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


RNA-sequencing data have been deposited in Gene Expression 
Omnibus (GEO) under accession codes GSM4.005162, GSM4005163, 
GSM4005164, GSM4005165, GSM4005166 and GSM4005167. Other 
data that support the findings of this study are available within 
the Source Data and Supplementary Information. Source Data for 
Figs. 1,2, 4 and Extended Data Figs. 1, 2, 4,5 are provided with the paper. 
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Extended Data Fig. 1| RV0222 inhibits host inflammatory responses. 

a, b, Luciferase assay of HEK293T cells transfected with the NF-KB (a) or AP-1(b) 
reporter gene with RvO222 plasmid (+) or vector plasmid (-).c, Immunoblot of 
lysates from iBMDMs transfected with a plasmid encoding Flag-tagged RvO0222 
for 48 h, following infection with H37Rv for the indicated times (MOI=5). 
p-p65, phosphorylated p65, an indicator of NF-KB activation; GAPDH, 
glyceraldehyde-3-phosphate dehydrogenase, aloading control. d, Immunoblot 
of RvO222 protein in H37Rv, ARVO222, H37Rv(ARVO222 + RvO222) and 
H37Rv(ARVO222 + RvO222(K76A)) cells with anti-RVO222 antibody. Coomassie 
blue staining was used as acontrol. e-g, qPCR analysis of /l1b (e), //6 (f) and /112 
p40(g) mRNA from peritoneal macrophages infected with H37Rv or ARvO222 


for the indicated times (MOI=5).h,i, Lactate dehydrogenase assay for cell death 
(h) and CFU assay (i) in mice peritoneal macrophages infected with H37Rv or 
ARv0222 for 3h (i), 12h or 24h (MOI=5).j, CS7BL/6 mice were aerosol-infected 
with roughly 200 CFUs per mouse of H37Rv, H37Rv(ARVO222 + GFP) or 
H37Rv(ARVO222 + RvO222) for 30 days. Histopathology was assessed in lung 
sections stained with acid-fast (scale bars, 100 xm (top) and 20 pm (bottom)). 
Two-tailed unpaired Student’s f-test (a, b, e-i) was used for statistical analysis. 
Data are representative of one experiment with at least three independent 
biological replicatesinc, d,j;and show mean+s.e.m. ofn=3 cultures ina, b, e-i. 
For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 2|See next page for caption. 


Extended Data Fig. 2| RV0222 interacts with and inhibits TRAF6 signalling. 
a, Immunoblot of lysates from peritoneal macrophages infected with H37Rv or 
ARvO222 for the indicated times (MOI=5). b,c, qPCR analysis of //1b (b) and //6 
(c) mRNA from peritoneal macrophages treated with the p38 inhibitor 
SB203580, JNK inhibitor SP600125 or NF-kB inhibitor PDTC for 1h following 
H37Rv or ARvO222 infection for 8 h (MOI=5).d, Immunoblots and 
immunoprecipitates of lysates from HEK293T cells transfected with the 
indicated plasmids. e-h, Luciferase assay of the effect of RVO222 onthe 
activation of TRAF6/TAK1in HEK293T cells transfected with a plasmid 
encoding NF-KB (e, g) or AP-1(f, h) luciferase reporter or indicated plasmids for 
24h.i-k, qPCR analysis of //1b (i, right), //6 (j) and /112 p40 (k) mRNA from 
control or TRAF6-knockdowniBMDMs infected with H37Rv or ARvVO222 for 4h 
(MOI=5).i, Left, immunoblots of TRAF6 in lysates from control (‘Scrambled’) 
or TRAF6-knockdown (TRAF6-1, TRAF6-2) iBMDMs. I-n, qPCR analysis of //1b 
(1, right), //6 (m) and //12p40 (n) mRNA from control or TAK1-knockdown 


iBMDMs infected with H37Rv or ARvO222 for 6h (MOI=5). 1, Left, immunoblots 
of TAK1in lysates from control or TAK1-knockdowniBMDMs. 0, Immunoblots of 
lysates from control or TRAF1-knockdowniBMDMs infected with H37Rv for the 
indicated times (MOI=5). p-r, qPCR analysis of /l1b (p), 116 (q) and /112 p40 (r) 
mRNA from control or TRAF1-knockdowniBMDMs infected with H37Rv for the 
indicated times (MOI=5).s,t, Immunoblots andimmunoprecipitation of 
lysates from HEK293T cells transfected witha plasmid encoding HA-tagged Ub 
(s) or K63-Ub (t), Flag-tagged TRAF6 and increasing amounts of Myc-tagged 
Rv0222 (Opg, lpg and 2 1g). u, Immunoblots of lysates from control or SHP1- or 
SHP2-knockdowniBMDMs. v-y, qPCR analysis of //1b and /I6 mRNA from 
control or SHP1/2-knockdown iBMDMs infected with H37Rv or ARvO222 for 6h 
(MOI=S). Two-tailed unpaired Student’s t-tests (b, c, e-n, v-y) were used for 
statistical analysis. Data are representative of one experiment with at least 
three independent biological replicates (a, d,o,s-u) and are mean+s.e.min 
b,c, e-n, p-r, v-y. For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 3 | RV0222 is K11-ubiquitinated at K76 by host ANAPC2. 
a,b, Immunoblots andimmunoprecipitates of HEK293T cell lysates 
transfected with a plasmid encoding Flag-tagged RvO222 and HA-tagged 
wild-type ubiquitin (Ub) or HA-tagged Ub witha single lysine residue left 
unmutated among K6, K11, K27, K29, K33, K48 and K63.c, Immunoblots and 
immunoprecipitates of lysates from peritoneal macrophages infected with 
H37Rv(ARVO222 + RvO222) for the indicated times (MOI=5). d, Immunoblots 


IB:K11/K48-Ub —IB:HA 


andimmunoprecipitates of HEK293T cell lysates transfected witha plasmid 
encoding Flag-tagged RvO222 and Myc-tagged ANAPC2, RNF7 and BIRC2. 

e, Immunoblots and immunoprecipitates of HEK293T cell lysates transfected 
with the indicated plasmids. Data are representative of one experiments with 
at least three independent biological replicates. For gel source data, see 
Supplementary Fig. 1. 


a b c d 
P=0.0121 
P=0.0439 1 
6,000 P=0.52 10 1 FLAG-TRAF6 - + + + P=00013 — P=0.0075 
S | Myc-Rv0222 
3 oO o S ¥ fo) 6 
& e 8 ° HA-K63-Ub s ° 
= 4,000 = 2 
s S 6 < 
S g IP:FLAG Zz 
8 2,000 3 4 IB:HA K63-UbE 
Fs 5 = 
a < 
0. 0 IP:FLAG 
Rvo222K8 + - + Rvo222K"3 + + FLAC ae AE CK hh 
3 N 
TRAF6 - + + TRAF6 - + + Lysate eee <2 ~ eo 
IB:Myc & 
e P=00144 f g h ARv0222+ 
FLAG-TRAF6 + - + + + 
85 P=00018 P=00419 
[1 1 FLAG-RvO222 + - + K76 FLAG-Rv0222 +K76 - + K76 HA-SHP1 - + + + + 
a ° CO -ATRAFO - + + + HA-SHP1 - - + + + Myc-Rv0222 - - - + K76 
s fe) rizuu ‘ 
3 IP:FLAG + IP:FLAG s IP:FLAG 
= on ; Breve IBHA\ “Ss! IB:HA eee re <srP1 
zs 
x IP:FLAG 
E IP:FLAG : f = —* <TRAFG 
2 iBFLAG 2 MN + RV0222 PAC ame ee + Rv0222 (5 FAG a a 
= =a Lysate a ee er 
Lysate | | FB Lysate IB:HA SHP1 
iH Sees pra ee ee < sHP 1 Lysate 
IB:FLAG ’ WR «rare 
Lysate Pa Lysate Lb 
IBFLAG | MR RV0222 Ce ee < .0222 aye SS ewe <20222 
ARv0222+ 
: % x i shRNA Scrambled ANAPC2 
T- Mm Rv0222 —__ 
i O™ ye shRNA __ Scrambled ANAPC2 ss giag-tRarg t- + ++ - ++ 
MMB TRAFG «= ME. TRAF6+RV0222 eX OO 
shRNA 2 FLAG-TRAF6 + - ++ +4 4+ - +4 +4 HASupy <b eb oes 
" Pr036,  FLAG-TRAF6 + -+ + - + HA-Ub - - + +KS3K83- - + + K83K83 et eeece ee eg 
- Q HARVOZ22 -++-++ Myc-Rv0222- + - ++ -- +-++4 Mye-Rvd2e2 
me} \° 
g 40 IP:FLAG IP:FLAG er 
5 BHA | Rv0222 IB:HA 
g si P<00001 Baia G 
> - IP:FLAG IB:HA t IP:FLA 
ES I J < : 
5 20 IB:FLAG hel TRAF6 ierLac @_ S88 Sa 
a 
¥ 10 Lysate IP:-FLAG 
va : : one Lysate 
2 IBHA SR MMP +RVO222 | Ee Gena re 
shRNA Scrambled ANAPC2 
m MB H37RvV n o 
[-] ARvo222 ere 
ee P<00001 Po 8 “3 600 P<0.0001 
mo P=00006 ca, rm 
3 15 s "9 P=0.0007 3 
> 34 P=0.0033 < 400 
Z P<0.0001 
z 10: Z & 4 
E 2 2 S 
25 p=00002 Pt e gS -o 
— | | >= of 
0 0. 0 
Oh 4h 8h Oh 4h~ 8h Oh 4h 8h Oh 4h 8h Oh 4h 8h Oh 4h 8h 
shRNA — Scrambled ANAPC2 shRNA Scrambled ANAPC2 shRNA Scrambled ANAPC2 
Pp H37Rv 
shRNA Scrambled ANAPC2 
Time(min) 0 5 15 306090 0 515 30 6090 
~~ — 
P= Sse + p-JNK 
—- ‘ i* p-p38 


- em < p-p65 
ee eee ee me GAPDH 


Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | K11-linked ubiquitination of RV0222 by ANAPC2 
suppresses TRAF6 signalling. a, b, Luciferase assay of HEK293T cells 
transfected with a plasmid encoding NF-KB (a) or AP-1(b) luciferase reporter or 
the indicated plasmids for 24 h.c, Immunoblots and immunoprecipitates of 
HEK293T cell lysates transfected with the indicated plasmids. Six-week-old 
female SCID mice were aerosol-infected with roughly 100 CFUs per mouse 

of H37Rv, H37Rv(ARvVO222 + GFP), H37Rv(ARvO222 + RvO222) or 
H37Rv(ARVO222 + RvO222(K76A)). d, e, qPCR analysis of //1b (d) and //6 (e) 
mRNA in lung tissues from mice infected for 7 days. f-h, Immunoblots and 
immunoprecipitates of HEK293T cell lysates transfected with indicated 
plasmids. i, Luciferase assay of control or ANAPC2-knockdown HEK293T cells 
transfected with a plasmid encoding NF-kB luciferase reporter or the indicated 


plasmids for 24 h.j-I, Immunoblots and immunoprecipitates of lysates from 
control or ANAPC2-knockdown HEK293T cells transfected with the indicated 
plasmids. m-o, qPCR analysis of //1b (m), /[6 (n) and //12 p40 (0) mRNA from 
control or ANAPC2-knockdown THP!I cells infected with H37Rv or ARvO222 for 
the indicated times (MOI=5). p, Immunoblot of lysates from control or 
ANAPC2-knockdowniBMDMs infected with H37Rv for the indicated times 
(MOI=S). Two-tailed unpaired Student’s t-test (a, b, d, e, i, m-o) were used for 
statistical analysis. Data are representative of one experiment with at least 
three independent biological replicates (c, f-h,j-I, p) and are mean+s.e.m. 
ina,b,i,m-o.d,e, Cumulative data from two independent experiments 

(n=6 mice). For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 5| K11-linked ubiquitination of RVO222 by ANAPC2 
suppresses anti- tuberculosis immunity. a, Survival of six-week-old 
female SCID mice aerosol-infected with roughly 100 CFUs per mouse of 
H37Rv, H37Rv(ARVO222 + GFP), H37Rv(ARvO222 + RvO222) or 
H37Rv(ARVO222 + RvO222(K76A)). b, Bacterial titres in lungs from mice 
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infected for 1 day or 14 days. Gchan-Breslow-Wilcoxon test (a) and two-sided 
Mann-Whitney U-test (b) were used for statistical analysis. Cumulative data 
from two independent experiments (n =37 miceina;n=4 mice infected for 
1day andn=8 mice infected for 14 days inb). 
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Extended Data Fig. 6 |Summary diagram. The K11-linked ubiquitination mRNAs (/l1b and /112 p40), part of innate immunity, by facilitating the 
(K11ub) of M.tuberculosis-secreted protein Rv0222 by host ANAPC2suppresses recruitment of SHP1/2 to polyubiquitinated (K63ub) TRAF6. Mtb, 
the TLR2/TRAF6/NF-KB-mediated expression of proinflammatory cytokine M. tuberculosis. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


x| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 


x 
| Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[x]|[__] A description of all covariates tested 


x A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
x] A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
a AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
[x] For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
a Give P values as exact values whenever suitable. 
x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection The qRT-PCR data: ViiA 7 Software v1.2.4 on ViiA 7DX. 
The western blot data: ImageQuant LAS 4000mini and Amersham Imager 600. 
The confocal data: Leica Microsystems on Leica TCS SP8 confocal laser microscopy system. 


Data analysis GraphPad Prism software 5.0 was used for all analyses. 
Image J software 1.51 was used for calculating co-localization ratio of confocal data. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


RNA-seq data have been deposited in the Gene Expression Omnibus under accession code GSM4005162, GSM4005163, GSM4005164, GSM4005165, GSM4005166 
and GSM4005167. Other data that support the findings of this study are available within the source data and Supplementary Information. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 
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For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical methods were used to predetermine sample sizes. Sample size was based on empirical data from pilot experiments. Reported in 
Materials and Methods section (Mice and infection). 


Data exclusions No exclusion of data points or images were used. 


Replication All attempts at replication were successful as indicated in the figure legends. All animal experiments were repeated and the data pooled for 
statistical analyses. 


Randomization — For mice infection experiment, Six -week-old C57BL/6 and SCID female mice were divided randomly into cages and infected with 
approximately 100-200 CFUs of different H37Rv strains. H37Rv strains were allocated into groups according to genotype of interest. 


Blinding The investigators were blinded during data collection and analysis where possible. This included Q-PCR data collection in cell and mice tissue 
cytokines assay. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
[x Antibodies x ChIP-seq 
[x Eukaryotic cell lines x Flow cytometry 
x Palaeontology x MRI-based neuroimaging 


[x Animals and other organisms 


x Human research participants 


x]|[_] Clinical data 


Antibodies 


Antibodies used The following antibodies were used for western blot or immunoprecipitation: Monoclonal mouse anti-FLAG M2 affinity gel 
(Catalog#/Clone: A2220/M2), rabbit anti-HA antibody (Catalog#/Clone:H6908/polyclonal), rabbit anti-GAPDH antibody (Catalog#/ 
Clone:SAB2701826/polyclonal), rabbit anti-FLAG antibody (Catalog#/Clone:F7425/polyclonal), and rabbit anti-K11 antibody 
(Catalog#/Clone: MABS107-1/2A3/2E6) antibodies were purchased from Sigma-Aldrich, St. Louis, MO. Rabbit anti-APC2 antibody 
(Catalog#/Clone: 12301/polyclonal), rabbit anti-phospho p65 antibody (Catalog#/Clone:3033/93H1), rabbit anti-phospho p38 
antibody (Catalog#/Clone:9215/3D7), rabbit anti-phospho Jnk antibody (Catalog#/Clone:9251/polyclonal) antibody, anti-rabbit 
IgG HRP-linked antibody (Catalog#/Clone:7074) and anti-mouse IgG HRP-linked antibody(Catalog#/Clone:7076) were from Cell 
Signaling Technology, Danvers, MA; rabbit anti-TRAF6 (Catalog#/Clone:sc-7221/polyclonal) and monoclonal mouse anti-Myc 
(Catalog#/Clone:sc-40/9E10) antibodies were from Santa Cruz Biotechnology, USA. The K11/K48-branched ubiquitin chains 
antibody was provided by Prof. Michael Rape (Howard Hughes Medical Institute) and Dr. Marissa Matsumoto (Genentech Inc.). 
The rabbit polyclonal antibody to Rv0222 was generated by immunization of rabbits with the purified His-RvO222 fusion protein, 
in collaboration with ABclonal Biotech, Shanghai, China. For western blot, all the primary antibodies were 1/1000 diluted and 
secondary antibodies were 1/2000 diluted. 


For immunofluorescence assay, monoclonal mouse anti-FLAG M2 antibody (Catalog#/Clone: F1804/M2, 1/200 dilution) were 
purchased from Sigma-Aldrich, St. Louis, MO and rabbit anti-TRAF6 (Catalog#/Clone:sc-7221/polyclonal, 1/100 dilution) was from 
Santa Cruz Biotechnology, USA. Corresponding Alexa Fluor 633- or Alexa Fluor 555-labeled anti-rabbit or anti-mouse antibodies 
(Invitrogen) were used as secondary antibody (1/1000 dilution). FITC, Fluorescein isothiocyanate (Catalog# 3326-32-7) was from 
Sangon Biotech, Shanghai, China for strains dyeing. 


Validation The Monoclonal mouse anti-FLAG M2 affinity gel, rabbit anti-HA antibody, rabbit anti-GAPDH antibody, rabbit anti-FLAG 
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antibody, rabbit anti-phospho p65 antibody, rabbit anti-phospho p38 antibody, rabbit anti-phospho Jnk antibody, rabbit anti- 
TRAF6 antibody rabbit anti-Myc antibody and rabbit anti-K63-linkage specific polyubiquitin antibody was validated for western 
blots of mouse samples or HEK293T cells overexpression system by many our previous publications (PMID: 26928339; 23001144; 
31110366). The rabbit anti-K11 antibody was validated for western blots of K11 recombinant protein on the manufacturer's 
website(https://www.sigmaaldrich.com/catalog/product/mm/mabs107i?lang=zh&region=CN). Rabbit anti-APC2 antibody was 
validated for western blots of human samples by previous publication (PMID: 29987118). The K11/K48-branched ubiquitin chains 
antibody was validated for western blots of Hela cell lysate by previous publication (PMID: 29033132). The rabbit polyclonal 
antibody to RvO222 was validated in this study. RvO0222 was detected in H37Rv (wild-type Mtb) with western blot using anti- 
Rv0222 antibody, while none was detected in RvO222 bacteria (knockout RvO222 in H37Rv)(Extended data Figure 1b). 


For immunofluorescence assay, monoclonal mouse anti-FLAG M2 antibody was validated for IF in U2OS cells expressing 
exogenous FLAG-tagged protein by previous publication (PMID: ZBTB48 is both a vertebrate telomere-binding protein anda 
transcriptional activator). The rabbit anti-TRAF6 antibody was validated for IF in mice BMDCs by previous publication 
(PMID:21460221). 
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Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HEK293T cells (ATCC CRL-3216) and THP1 (ATCC TIB-202) were obtained from the American Type Culture Collection (ATCC). 

Authentication Cell lines purchased from commercial vendors, ATCC have been authenticated by the commercial vendor using short tandem 
repeat (STR) analysis. Generated knockout or knockdown cells were validated by specific antibodies as shown in extended 
figure. 

Mycoplasma contamination All the cells were routinely tested for contamination by mycoplasma and tested negative cells were used in the study. 


Commonly misidentified lines No commonly misidentified cell lines were used in this study. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals All animal experiments were reviewed and approved by the Animal Experiment Administration Committee of Tongji University 
School of Medicine and were conducted in accordance with the National Institutes of Health (NIH) Guidelines for the Care and 
Use of Laboratory Animals. Female SPF C57BL/6 and SCID mice were purchased from Slaccas (Shanghai). Six- to eight-week-old 
female C57BL/6 mice were used for macrophage separation. Six -week-old C57BL/6 and SCID female mice were divided 
randomly into cages and infected by an aerosol method with approximately 100-200 CFUs of different H37Rv strains for 4 weeks 
(using a Glas-Col inhalation exposure system (Glas-col, Terre Haute, IN)) at the Biosafety Level-3 (BSL-3) Laboratory. 


Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from the field. 
Ethics oversight All animal experiments were reviewed and approved by the Animal Experiment Administration Committee of Tongji University 


School of Medicine and were conducted in accordance with the National Institutes of Health (NIH) Guidelines for the Care and 
Use of Laboratory Animals. Reported in Materials and Methods section (Mice and infection). 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Immune surveillance against pathogens and tumours in the central nervous system is 
thought to be limited owing to the lack of lymphatic drainage. However, the 
characterization of the meningeal lymphatic network has shed light on previously 
unappreciated ways that an immune response can be elicited to antigens that are 
expressed in the brain’ °. Despite progress in our understanding of the development 
and structure of the meningeal lymphatic system, the contribution of this network in 
evoking a protective antigen-specific immune response in the brain remains unclear. 
Here, using amouse model of glioblastoma, we show that the meningeal lymphatic 
vasculature can be manipulated to mount better immune responses against brain 
tumours. The immunity that is mediated by CD8T cells to the glioblastoma antigen is 
very limited when the tumour is confined to the central nervous system, resulting in 
uncontrolled tumour growth. However, ectopic expression of vascular endothelial 
growth factor C (VEGF-C) promotes enhanced priming of CD8 T cells in the draining 
deep cervical lymph nodes, migration of CD8 T cells into the tumour, rapid clearance 
of the glioblastoma and a long-lasting antitumour memory response. Furthermore, 
transfection of an mRNA construct that expresses VEGF-C works synergistically with 
checkpoint blockade therapy to eradicate existing glioblastoma. These results reveal 


the capacity of VEGF-C to promote immune surveillance of tumours, and suggest a 
new therapeutic approach to treat brain tumours. 


We used the C57BL/6 syngeneic cell line GL261 to model glioblastoma 
in mice. Mice injected orthotopically with luciferase-expressing GL261 
cells (GL261-Luc) showed cell-number-dependent growth kinetics and 
lethality (Extended Data Fig. la-c), demonstrating that intracranial 
injection of GL261-Luc is not sufficient to promote tumour rejectionin 
the central nervous system (CNS). To evaluate the effects of enhanced 
lymphangiogenesis, we used two modes of delivery to express the 
lymphangiogenesis-promoting factor VEGF-C in mice: an adeno-associ- 
ated viral vector (AAV9) andan mRNA delivery vector. Consistent with 
previous reports'*®, injection of AAV-VEGF-C into the cerebrospinal 
fluid (CSF) of mice through the cisterna magna remodelled meningeal 
lymphatic vessels in the dural confluence of sinuses (Fig. 1a, b) andin 
the sagittal sinuses (Extended Data Fig. 1h, i), and did not affect the 
integrity of the blood-brain barrier* (Extended Data Fig. 1d, e). Inmice 
that were prophylactically treated with AAV-VEGF-C, we observed a 
near-complete rejection of tumours (Fig. 1c, Extended Data Fig. 1j). 


VEGF-C-mediated protection requires T cells 

Previous studies have shown that deep cervical lymph nodes are the 
primary draining lymph nodes of the CNS, and that mandibular and 
superficial cervical lymph nodes contribute to antigen sampling 


in the CNS'*°”. Thus, we surgically ligated the afferent lymphatic 
vessels that drain to both deep cervical lymph nodes in mice treated 
with AAV-VEGF-C. Although AAV-VEGF-C-treated mice showed 
prolonged survival compared to control mice (Fig. 1d), after ligation of 
the deep cervical lymph nodes the majority of the AAV-VEGF-C-treated 
mice succumbed to the tumour. This indicates that VEGF-C-mediated 
protection against glioblastoma requires lymph drainage to the deep 
cervical lymph nodes. 

The requirement for the deep cervical lymph nodes suggests that the 
immune system has a role in VEGF-C-mediated protection. Depletion 
of CD4 or CD8 T cells negated the protection that was conferred by 
VEGF-C (Fig. le, Extended Data Fig. 1k). By contrast, B-cell-deficient 
(uMT) mice that were treated with VEGF-C were protected from glioblas- 
toma (Extended Data Fig. 11). We examined the durability of this immune 
response against glioblastoma in mice that were treated with VEGF-C, 
and found that mice that rejected the intracranial tumour showed long- 
term systemic memory responses; rechallenge with GL261-Luc in the 
flank resulted in no detectable tumours (Fig. 1f, Extended Data Fig. 1m). 
Together, these data demonstrate that by increasing lymphangiogen- 
esis in the meninges, prophylactic treatment with VEGF-C can evoke 
arobust and long-lasting T-cell-dependent immune response against 
brain neoplasms. 
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Fig. 1| VEGF-C-mediated protection against glioblastoma depends onthe 
draining lymph nodes and onT cells. a, b, C57BL/6 mice were injected with 
AAV-CTRL or AAV-VEGF-C intracisternally through the cisterna magna. Six to 
eight weeks later, mice were euthanized and the dura was collected to image 
the lymphatic vasculature (LY VE1") inthe confluence of sinuses (AAV-CTRL, 
n=7; AAV-VEGF-C,n=8).***P=0.0007.c, C57BL/6 mice that had been injected 
with AAV-CTRL or AAV-VEGF-C two months previously were implanted with 
5,000 GL261-Luc cells in the striatum and monitored for survival (naive, n=3; 
AAV-CTRL,n=4; AAV-VEGF-C, n= 8). ***P=0.0004 for naive versus AAV- 
VEGF-C; P= 0.33 (NS, not significant) for naive versus AAV-CTRL. d, Six-to-eight 
weeks after injection of AAV vectors, the deep cervical lymph nodes of mice 
were ligated using a cauterizer. Seven days later, mice were challenged with 
50,000 GL261-Luc cells in the striatum and monitored for survival (AAV-CTRL, 
n=4; AAV-CTRL with lymph node (LN) ligation, n=4; AAV-VEGF-C, n=4; 
AAV-VEGF-C with lymph node ligation, n =10).**P=0.007 for AAV-CTRL 


Glioblastoma is devoid of lymphangiogenic signals 
Previous studies have shown that tumour-intrinsic overexpression of 
VEGF-C in mice or humans results in poorer prognosis for malignancies 
that occur in areas outside of the CNS®”. We compared the transcriptomes 
of brain tissue from healthy control individuals and patients with glioblas- 
toma using data fromthe Genotype-Tissue Expression project (GTEX) and 
The Cancer Genome Atlas (TCGA), respectively. Tumour tissue showed 
higher expression of VEGFA and CD31 (also known as PECAMI) than control 
tissue, as previously shown”, but a decrease in the expression of VEGFC 
(Fig. 2a—c, Extended Data Fig. 2a, b)—which was also seen in mice that 
bear glioblastoma tumours (Extended Data Fig. 2c). When we stratified 
the patients with glioblastoma into two groups on the basis of their VEGFC 
expression levels (high versus low), we observed no differences in survival 
(Extended Data Fig. 2d, e), as the expression levels of VEGFC in patients 
with glioblastoma were lower than those in healthy brain tissue even in 
the group with high VEGFC expression. However, inarecently published 
dataset” of patients with glioblastoma who were treated with neoadjuvant 
anti-PD-1, the expression of VEGFCwas highly correlated with an increase 
in the infiltration of T cells (as measured by the expression of the T cell 
marker genes CD3F, CD4 and CD8B) after treatment (Fig. 2d-f). Collec- 
tively, these results suggest that the glioblastoma microenvironment is 
deprived of lymphangiogenic signals bothin patients with glioblastoma 
andin experimental mice, and that ectopic expression of VEGF-C confers 
protection against glioblastoma in a mouse model. 


VEGF-C potentiates immune checkpoint blockade 


Although treating mice with AAV-VEGF-C led to survival benefits with 
no long-term side effects (Extended Data Fig. Id, e, g), this vector 
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versus AAV-VEGF-C;***P< 0.0001 for AAV-CTRL versus AAV-VEGF-C LN ligation; 
P=0.15 (NS) for AAV-CTRL versus AAV-CTRLLN ligation. e, Similar toc, but 
mice that had previously been injected with AAV-VEGF-C were treated with 
anti-CD4 or anti-CD8 antibodies (AAV-CTRL, n=6; AAV-VEGF-C,n=5; 
AAV-VEGF-C + anti-CD8, n=5; AAV-VEGF-C + anti-CD4, n=5) **P=0.0014 for 
AAV-CTRL versus AAV-VEGFC; P= 0.34 (NS) for AAV-CTRL versus AAV-VEGF-C 
with anti-CD8;**P= 0.0014 for AAV-CTRL versus AAV-VEGF-C with anti-CD4. 

f, Mice injected with AAV-CTRL or AAV-VEGF-C that survived over 100 days 
after challenge with 5,000 GL261-Luc cells were rechallenged with 500,000 
GL261-Luc cells in the flank. Tumours were measured after flank rechallenge at 
days 7 and 15 (n=3 per group, all 3 mice from the GL261100 d group had no 
measurable tumours). Data are pooled from two independent experiments 
(b-f) and are mean +s.d. Pvalues were calculated by two-tailed unpaired 
Student’s f-test or two-sided log-rank Mantel-Cox test. 


is—owing to the immunogenicity of AAV’?—less effective when given 
subsequently to the same host. By contrast, an mRNA delivery vector 
does not elicit an immune response against the vector’. Therefore, we 
designed an mRNA construct to express VEGFC, with stabilizing base 
substitutions (Extended Data Fig. 3a, b). Transfection of HEK293T cells 
with the VEGFC mRNA construct produced both the full-length and the 
processed form of VEGF-C" (Extended Data Fig. 3c). Administration of 
the VEGFC mRNA construct into the cisterna magna of mice resulted 
in high levels of VEGF-C in the CSF (Extended Data Fig. 3d). We used 
Cy5-labelled mRNA to confirm the wide distribution of VEGFC mRNA 
within the dura mater (Extended Data Fig. 3f), and saw its uptake by 
immune cells (CD45*), endothelial cells (CD45°CD31°) and other cells 
(CD45 CD31) in the brain, meninges and deep cervical lymph nodes 
(Extended Data Fig. 3g, h). Furthermore, we observed an increase in 
the levels of secreted VEGF-C protein that was specific to the CSF and 
meninges, with no changes in the brain or serum (Extended Data Fig. 3i). 
Using AKT phosphorylation to measure vascular endothelial growth 
factor receptor (VEGFR) signalling in endothelial cells after treatment 
with VEGF-C, we observed an increase in VEGFR signalling that was 
specific to the lymphatic endothelial cell (LEC) population; by contrast, 
no changes were observed in the blood endothelial cells (BECs) inthe 
meninges or deep cervical lymph nodes (Extended Data Fig. 4a-c). 
This increase in VEGFR signalling was not accompanied by structural 
deformities in either the LECs or the BECs, even within an angiogenic 
tumour environment (Extended Data Fig. 4b-e). These data indicate 
that whereas VEGFC mRNA is taken up by various cells of the brain, 
meninges and deep cervical lymph nodes, VEGF-C protein is mostly 
confined to the CSF and meninges, and activates LECs. 

Inaddition to the difference inimmunogenicity, the expression kinet- 
ics of the VEGFCmRNA construct and the AAV-VEGF-C vector differ; the 
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Fig. 2| Human glioblastomais deprived of lymphangiogenic signals at 
steady state, and VEGF-C levels correlate with T cell infiltration with 
anti-PD-1 therapy. RNA sequencing (RNA-seq) data of tumour tissue (TCGA, 
study accession phs000178.v10.p8) and healthy brain cortex (GTEX, study 
accession phs000424.v7.p2). a—c, Expression profiles of VEGFA (b), CD31 
(angiogenic) (a) and VEGFC (lymphangiogenic) (c) genes in samples of healthy 
cortex versus glioblastoma (cortex, n=133 samples; glioblastoma, n=147 
samples). d-f, Correlation (from RNA-seq) of changes in the expression of 
VEGFCand of the T cell markers CD3E (d), CD4 (e) and CDSB (f) after PD-1 
therapy (data from GSE121810; n= 24). Dataare mean+s.d. Pvalues were 
calculated by two-tailed unpaired Student’s t-test or Pearson’s correlation. 


VEGFCmRNA construct shows peak expression after 24 h, whereas the 
expression of AAV-VEGF-C does not reach maximal levels for several 
weeks». We evaluated the therapeutic efficacy of both modalities by 
administering mice with each vector on different days, and found that 
only prophylactic (day 60) treatment with AAV-VEGF-C resulted in 
long-term survival (Extended Data Fig. 4f-h). By contrast, when mono- 
therapy with the VEGFC mRNA construct was administered at days 0, 
3 or 7, survival benefits were still observed—although none of these 
treated mice survived in the face of rapid glioblastoma development 
(Extended Data Fig. 4f). 

Similarly, anti-PD-1 and other checkpoint blockade therapies show 
little benefit against glioblastoma when used as monotherapies in 
preclinical models”, and have not shown favourable responses in 
clinical trials'®. However, strategies that promote T cell responses— 
such as activation of dendritic cells”, or radiation’*—potentiate anti- 
PD-1 therapy against glioblastoma, suggesting that the hurdles in the 
treatment of cancers of the CNS may be during T cell priming. Hypoth- 
esizing that VEGF-C increases immune surveillance of the brain, we 
reasoned that therapy using VEGF-C would have synergistic antitumour 
effects with checkpoint inhibitor therapy. Whereas anti-PD-1 alone had 
marginal effects on survival, a combination of treatment with VEGFC 
mRNA and anti-PD-1 antibody resulted in regression of tumours and 
survival benefits in mice bearing established GL261 tumours (Fig. 3a, 
b), ina T-cell-dependent manner (Extended Data Fig. 5d). When mice 
that survived in the long term were rechallenged with tumours in the 
contralateral hemisphere, they exhibited resistance to the secondary 
challenge (Extended Data Fig. 5b), and T cells transferred from the 
draining lymph nodes and spleen of mice that rejected tumours con- 
ferred protection against a primary challenge (Extended Data Fig. 5c). 
Asurvival benefit was also observed in mice bearing tumours derived 
froma more proliferative and invasive syngeneic glioblastoma cell 
line (CT-2A) after treatment with a combination of VEGFC mRNA and 
anti-PD-1 together with the T-cell-activating agent anti-4-1BB”° 
(Extended Data Fig. Seg). In addition, other checkpoint inhibitors that 
are known to be ineffective’*"””! were potentiated with VEGFC mRNA 
(Extended Data Fig. 5h-j). Even when mice were treated after the tumour 
mass had increased substantially (day 20), the combination therapy 
showed survival benefits (Extended Data Fig. 5k, I). 


To assess whether T cell priming against glioblastoma in the deep cer- 
vical lymph nodes was increased after mice were treated with VEGF-C, 
we used an endogenous tumour antigen that is present in many mouse 
cancers”. Endogenous retroviruses are integrated remnants of retro- 
virus in the host genome that are silenced epigenetically, but which 
often become aberrantly expressed in the dysregulated transcrip- 
tional states that are found in cancers”. We identified endogenous 
retrovirus sequences based on the gene Emv2that were overexpressed 
in GL261 cells (Extended Data Fig. 6a, b). We therefore constructed 
tetramers against emv2-env (an envelope protein coded by Emv2)” 
and found that after administration of GL261 cells in the flank of mice, 
tetramer-positive CD8 T cells were enriched in the draining inguinal 
lymph nodes (Extended Data Fig. 6d)—demonstrating that endogenous 
tumour-specific T cell priming occurs in an antigen-dependent manner 
in response to GL261. 

Intracranial inoculation of mice with GL261-Luc resulted in small 
tetramer-positive populations of T cells in the deep cervical lymph 
nodes (1.54%) (Fig. 3c, d). However, treatment with the VEGFC mRNA 
construct resulted in significant increases inthe tumour-specific T cell 
population in the deep cervical lymph nodes (3.65%) (Fig. 3c, d). In addi- 
tion, increases in both the percentage (Fig. 3d) and the absolute num- 
bers (Extended Data Fig. 5a) of tumour-specific T cells were detected in 
the brain after treatment with the VEGFC mRNA construct. These data 
reveal that intracranial tumours elicit minimal CD8 T cell responses in 
the deep cervical lymph nodes, but that VEGFC mRNA enhances the 
priming of CD8 T cells against tumours in the brain. 

We then examined the specificity of VEGF-C in potentiating check- 
point inhibitor therapy. First, to confirm the requirement of menin- 
geal lymphatic vessels, a soluble version of VEGFR-3 (VEGFR-3,_3-lg) 
was administered, using an AAV construct, to sequester VEGF-C; this 
resulted in atrophy of lymphatic vasculature in the dura*™ (Extended 
Data Fig. 7a, b). Treatment with VEGFR-3,_,-lg abrogated the efficacy 
of VEGFC mRNA and anti-PD-1 therapy (Extended Data Fig. 7c). VEGF-C 
also uniquely provided therapeutic benefits in combination with 
anti-PD-1 when compared to other recombinant proteins of the VEGF 
family (VEGF-A, VEGF-B, VEGF-Cs and VEGF-D; Cs is Cys156Ser, a VEGFR- 
3-selective agonist) (Extended Data Fig. 7d-f). These experiments 
demonstrate the specificity of the antitumour capacity of VEGF-C and 
the requirement for the meningeal lymphatic vessels in providing thera- 
peutic benefits against glioblastoma. 

Tumour-intrinsic VEGF-C has also previously been reported to medi- 
ate increased metastases in melanoma and breast cancer’, To assess 
whether this was a possibility in the CNS, cells expressing blue fluo- 
rescence protein (BFP) were tracked to determine the distribution of 
tumour cells in vivo. We collected the brain and deep cervical lymph 
nodes to measure tumour cells (CD45 BFP"), as well as immune cells 
that may have phagocytosed tumour cells (CD45*BFP*). No CD45 BFP* 
cells were found inthe deep cervical lymph nodes after treatment with 
VEGF-C, suggesting that VEGF-C does not promote metastasis of glio- 
blastoma to the lymph nodes (Extended Data Fig. 7g-k). In addition, 
direct effects of VEGF-C ontumour cells are unlikely, as glioblastoma 
cells showed no expression of VEGFR-3, and VEGF-C had no effect on 
cell proliferation in culture (Extended Data Fig. 71, m). Of note, both 
VEGF-C delivery vectors (AAV and mRNA) resulted in an increase inthe 
number of CD45‘BFP’ cells in the deep cervical lymph nodes (Extended 
Data Fig. 7i)—consistent with reports that antigen drainage is increased 
by exogenous VEGF-C!?. 

To examine how VEGF-C modifies the immune landscape of tumours 
before anti-PD-1 therapy, we performed flow cytometry in the brain, 
meninges and deep cervical lymph nodes of mice after treatment with 
the VEGFCmRNA construct. Myeloid populations of cells showed mini- 
mal changes, with no differences in their levels of activation (CD80) or 
antigen-presentation capabilities (major histocompatibility complex 
(MHC) II) (Extended Data Fig. 8a—g). The largest changes induced by 
treatment with the VEGFC mRNA construct were in the number and 
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Fig. 3 | Therapeutic delivery of VEGF-C potentiates checkpoint inhibitor 
therapy by enhancing T cell priming and recruitment. a, Schematic of 
treatment plans. b, Mice inoculated with 50,000 GL261-Luc cells were treated 
with VEGFC mRNA or GFP mRNA constructs (day 7) and with either anti-PD-1 
(RMP1-14) antibodies or isotype control antibodies (days 7, 9 and 11), and 
monitored for survival (top; Kaplan-Meier curve; GFP mRNA + isotype, n=11; 
VEGFC mRNA + isotype, n=9; GFP MRNA + anti-PD-1, n= 8; VEGFCMRNA + 
anti-PD-1, n=10) and tumour burden (bottom; GFP mRNA + isotype, n=7; 
VEGFC mRNA + isotype,n=5; GFP mRNA + anti-PD-1, n=4; VEGFC mRNA + anti- 
PD-1,n=5). Dataare pooled from 2 independent experiments. ****P< 0.0001. 
RLU, relative luminescence units. c, d, Mice were inoculated with 50,000 
GL261-Luc cells and treated with GFP mRNA or VEGFC mRNAat day 7. Seven 
days after mRNA treatment, the deep cervical lymph nodes (dCLN) and 
tumour-bearing brain hemispheres were collected to detect tetramer-positive 


the phenotypes of T cells that infiltrate the brain (Fig. 3e, Extended 
Data Fig. 9a). This result, combined with the increase in CD45*BFP* 
cells inthe deep cervical lymph nodes (Extended Data Fig. 7i), suggests 
that the increased priming and subsequent infiltration of T cellsisa 
result of increased antigen drainage. Mice that were treated with the 
VEGFC mRNA construct also showed a stable number of TCF1* CD8 
Tcells—a population that is important for eliciting antitumour effector 
responses”©”” (Extended Data Fig. 9c-f)—in the three compartments 
(brain, meninges and deep cervical lymph nodes), along with increased 
FOXP3* CD4 T cells (Extended Data Fig. 9f). Notably, a higher propor- 
tion of FOXP3* cells expressed T-bet, whichis a critical regulator of the 
differentiation of T helper 1 cells (Extended Data Fig. 9f) and is associ- 
ated with increased levels of IFNy in regulatory (CD4*CD25") T cells 
and reduced suppression of CD4 effector cells”’. These data indicate 
that there is a shift towards an antitumour immune response in mice 
that are treated with VEGF-C. Consistent with this, CD8 T cells in the 
brain of VEGF-C treated mice were also capable of producing multiple 
cytokines after ex vivo stimulation (Extended Data Fig. 9e-i)—a positive 
prognostic factor for immunotherapy”. All of these changes seemed to 
occur independently ofa direct effect on T cells, as no VEGFR-3 expres- 
sion was detectable in CD4 and CD8T cells (Extended Data Fig. 7n), and 
VEGF-C did not affect the proliferation of T cells in vitro (Extended Data 
Fig. 7p). We did not observe expression of VEGFR-3 in other immune 
compartments, and stimulation of bone-marrow-derived dendritic 
cells with VEGF-C induced no change in the expression levels of MHC 
llor costimulatory molecules (Extended Data Fig. 70). Together, these 
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CD8T cells. c, Concatenated fluorescence-activated cell sorting (FACS) plots of 
CD45*CD3*CD8*CD44' T cells in tumour-bearing brain and deep cervical lymph 
nodes after treatment with GFP mRNA or VEGFC mRNA. d, Quantification of 
deep cervical lymph nodes (left; circle, ipsilateral; square, contralateral 

(GFP mRNA, n=6; VEGFCmRNA, n=12) or tumour-infiltrating tetramer-positive 
CD8T cells (right) (GFP MRNA, n=3; VEGFCmMRNA, n=5). The experiment was 
repeated independently with similar results. ***P= 0.007 (lymph nodes); 

**P= (0.002 (tumour). e, Mice were inoculated with 50,000 GL261-Luc cells and 
treated with Luc mRNA or VEGFCmRNAat day 7. Tumour-inoculated brain 
hemisphere tissue was collected and analysed using FACS, and the number of 
CD3-positive cells was assessed (n= 3; 3 mice were pooled for each sample). 
Data are mean ¢+s.d. Pvalues were calculated by two-tailed unpaired Student’s 
t-test or two-sided log-rank Mantel-Cox test. 


results show that VEGF-C provides a potent antitumour environment, 
which—in association with increased lymphatic drainage and antigen 
presentation—promotes multifunctional and durable T cell immunity 
against glioblastoma. 


VEGF-C restores CNS-restricted T cell priming 


In contrast to what is seen in patients with glioblastoma, reports have 
shown that combined therapy with nivolumab and ipilimumab had 
intracranial efficacy, concordant with extracranial activity, in patients 
with melanoma that had metastasized to the brain*®. Consistent with 
clinical observations, mice with both a flank and an intracranial tumour 
responded better to immunotherapy than did those with just an intrac- 
ranial melanoma”. To examine whether VEGFC mRNA is effective in 
treating other (non-glioblastoma) types of cancer inthe CNS, we used 
the melanoma cell lines YUMMER1.7 and B16 (Extended Data Fig. 10a). 
Mice bearing only intracranial YUMMERI.7 tumours showed signifi- 
cant survival benefits when treated with VEGFC mRNA and checkpoint 
inhibitor therapy (Fig. 4a). By contrast, mice with both intracranial and 
flank tumours benefited from checkpoint inhibitor therapy regard- 
less of VEGFC mRNA treatment (Fig. 4b). In fact, the survival benefits 
to mice with only intracranial tumours that received combination 
therapy were similar to mice with both intracranial and flank tumours 
that were treated with checkpoint inhibitor therapy alone (Fig. 4c). In 
addition, ligation of the deep cervical lymph nodes only affected mice 
with intracranial tumours, not mice with both intracranial and flank 
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Fig. 4 | Therapeutic delivery of VEGF-C mediates protection against 
intracranial melanoma when combined with checkpoint inhibitor blockade 
and is equivalent to peripheral priming. a—c, Mice were given either only 
YUMMER1.7 intracranial tumours (IC) (a) or botha YUMMER1.7 intracranial 
tumour anda YUMMER1.7 flank tumour (FT) (b), and treated with GFP mRNA or 
VEGFCmRNA on day7, and anti-PD-1 (RMP1-14) and anti-CTLA4 (9H10) on days 7, 
9 and 11. Two groups (ligation IC VEGFC mRNA + anti-PD-1+ anti-CTLA4 and 
ligation FT IC VEGFC mRNA + anti-PD-1+ anti-CTLA4) had lymph nodes ligated 

7 days before tumour implantation (a, n=12 for all groups except ligation IC 
VEGFCmRNA + anti-PD-1+ anti-CTLA4, n=7;b,n=12 for all groups except 
ligation FT IC VEGFCmRNA + anti-PD-1+ anti-CTLA4, n=7). Pvalues: a, P=0.13 
(NS) for IC VEGFC mRNA; ***P= 0.001 for IC GFP mRNA + anti-PD-1+ anti-CTLA4; 
****P < 0.0001 for IC VEGFC mRNA + anti-PD-1+ anti-CTLA4; **P= 0.069 for 
ligationIC VEGFCmRNA + anti-PD-1+ anti-CTLA4 (all versus IC GFP mRNA); 

b, *P=0.013 for FT IC VEGFC mRNA; **P= 0.0029 for FT IC GFP mRNA + anti- 


tumours (Fig. 4a, b). Similar results were observed with the B16 cell 
line (Extended Data Fig. 10c-e). These results are consistent with the 
idea that T cell priming—either through expression of VEGF-C in the 
CSF or througha flank tumour—enables efficient checkpoint inhibitor 
responses inthe CNS. However, in the case of atumour that is confined 
to the CNS at steady state, immune checkpoint inhibitors alone do not 
confer notable benefits, regardless of tumour type. 

Finally, we sought to understand whether the antitumour effects of 
VEGF-C are due to (i) the expansion and differentiation of T cells that 
are capable of trafficking to the brain (that is, T-cell-intrinsic); or (ii) 
changes inthe brain environment that cause an increase in the recruit- 
mentof T cells (that is, T-cell-extrinsic). We performed adoptive transfer 
of leukocytes from the draining lymph nodes of tumour-bearing mice 
into recipient mice that also had tumours (Fig. 4d, e). The infiltration 
of T cells into the brain was slightly increased (T-cell-intrinsic; 1.6-fold 
increase) ifthe T cells were from mice treated with VEGFC mRNA. How- 
ever, T cells from tumour-bearing control mice were able to migrate 
into glioblastoma-affected brains to a greater extent if the recipient 
was treated with the VEGFC mRNA construct (T-cell-extrinsic; 3.1-fold 
increase) (Fig. 4e). Moreover, donor T cells from mice that were 
treated with VEGFCmRNA, transferred into a host that was also treated 
with VEGFC mRNA, were able to infiltrate the tumour most efficiently 
(combinatorial; 4.2-fold increase). Increases in the numbers of T cells 
inthe lymph nodes were dependent onthe recipient mice having been 
treated with VEGFC mRNA—consistent with the notion that antigen- 
specific T cells proliferate only after increased drainage of tumour 
antigens from the CNS. These results show that VEGF-C provides an 
antitumour environment through a combination of its effect on the 
meningeal lymphatic vasculature and the changes that it induces in 
T-cell-intrinsic properties. 


Lymph node 


PD-1+ anti-CTLA4; ***P= 0.0003 for FT IC VEGFC mRNA + anti-PD-1+ anti- 
CTLA4; P= 0.051 (NS) for ligation FT IC VEGFC mRNA + anti-PD-1+ anti-CTLA4 
(all versus FT IC GFP mRNA); c, P= 0.72 (NS) for IC VEGFC mRNA + anti- 
PD-1+anti-CTLA4 versus FT IC VEGFCmRNA + anti-PD-1+ anti-CTLA4. 

d, Schematic of experimental design for the results shown ine. Congenic 
CD45.2 mice were injected with GL261 tumours. Seven days later, mice were 
treated with GFP mRNA or VEGFC mRNA. Seven days after the mRNA treatment 
(14 days after tumour inoculation) leukocytes from deep cervical lymph nodes 
were transferred into congenic CD45.1 mice bearing 7-day tumours. Five days 
after transfer, the deep cervical lymph nodes and brain tissues were collected 
toanalyse T cell infiltration. e, Quantification of brain-infiltrating T cells and 

T cells inlymph nodes; numbers denote the fold change in the number of cells 
over donor GFP and recipient GFP groups (n=5 mice per group). Dataare 
mean+s.e.m. For Pvaluesine, *P< 0.05; ****P< 0.0001. Pvalues were calculated 
by two-tailed unpaired Student’s t-test or two-sided log-rank Mantel-Cox test. 


Discussion 


We have demonstrated that ectopic expression of VEGF-C has the 
potential to manipulate meningeal lymphatics, and thereby enables 
immune surveillance and T-cell-mediated immunity against brain 
tumours (Extended Data Fig. 10g). These results support a growing 
body of work that highlights the importance of the meningeal lym- 
phatic network in controlling immune responses in the CNS? >. The lack 
of immune responses against tissue grafts” in the CNS has led to the 
idea that the brain has animmune-privileged status, and a similar phe- 
nomenon of immune privilege may allow primary brain tumours such 
as glioblastoma to grow unhindered in the CNS”. Together with these 
previous reports, our study suggests that tumours confined within 
the CNS may be inaccessible for immune priming, and that exogenous 
expression of VEGF-C potentiates endogenous immune responses. We 
considered the possibility that immune-cell- or tumour-cell-intrinsic 
VEGF-C signalling causes this phenomenon, but found that immune 
and tumour cells showed no changes after direct stimulation with 
VEGF-C. Instead, our data suggest that VEGF-C (and its relationship 
with its main receptor partner VEGFR-3) leads to an increase in lym- 
phatic drainage**, which is necessary for immunosurveillance against 
glioblastoma-—similar to the way in which melanoma-intrinsic overex- 
pression of VEGF-C enhances antitumour T cell responses®. It is worth 
noting that the immune-privileged status of the CNS does not apply to 
all brain antigens®, and future studies are needed to define the rules of 
CNS antigen sampling. In summary, our study shows that increasing 
lymphatic drainage can overcome the immune ignorance of glioblas- 
toma. This lays the foundation for anew strategy in which modulation of 
the lymphatic vasculature is used to increase the effects of checkpoint 
inhibitor therapy against tumours in immune-privileged sites. 
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Methods 


Mice 

Four- to-eight-week-old mixed sex C57BL/6 mice, B6.129S2-/gh"™"©"/ 
J (MT) mice and B6.SJL-Ptprc*Pepc>/BoyCrl mice were purchased from 
the National Cancer Institute, Jackson Laboratory and Charles River, 
respectively, and subsequently bred and housed at Yale University. 
All procedures used in this study (sex-matched, age-matched) complied 
with federal guidelines and the institutional policies of the Yale School 
of Medicine Animal Care and Use Committee. 


Cells 

GL261 parental cells were obtained from the NIH cancer cell repository. 
GL261-Luc cells were a gift fromJ. Zhou (Yale Neurosurgery) and C. 
Rothlin (Yale Immunobiology). YUMMER1.7 cell lines have previously 
beenreported®. CT-2A and CT-2A-BFP cells were a gift from T. Mathivet 
(Paris Centre de Recherche Cardiovasculaire). B16 cells were a gift from 
N. Palm (Yale Immunobiology). HEK293T cells were purchased from 
ATCC. HEK293T, CT-2A, CT-2A-BFP and B16 cells were cultured incom- 
plete DMEM (4.5¢ 17 glucose, 10% FBS, 1% penicillin/streptomycin). 
YUMMER1.7 cells were cultured in DMEM/F-12 medium (10% FBS, 1% 
non-essential amino acids, 1% penicillin/streptomycin). GL261 and 
GL261-Luc cells were cultured in RPMI (10% FBS, 1% penicillin/strep- 
tomycin). All cells tested negative for mycoplasma contamination. 


Viral vectors 

We used AAV9 encoding either human VEGF-C, mouse VEGF-C or solu- 
ble mouse VEGFR-3. AAV-VEGF-C (mouse) was generated using the 
psubCMV WPRE plasmid as previously reported for human VEGF-C*. 
Control AAV-VEGFR-3,_,-lg encodes domains 4-7 of mouse VEGFR-3 
(that do not bind VEGF-C or VEGF-D) fused to the mouse IgG1 Fc domain. 
VEGFR-3,_;-lg, which encodes domains 1-3 of mouse VEGFR-3, was used 
to sequester VEGF-C in vivo. Young mice (2-4 weeks old, AAV-VEGFR-3; 
4-6 weeks-old, AAV-VEGF-C) received an injection of a single dose of 
AAVs (3 x10” viral particles per mouse per 3 pl) into the cisterna magna. 
After 6-8 weeks, they were engrafted with intracerebral GL261 or CT-2A 
glioblastoma cells. 


Antibodies and tetramer 

Anti-CD45 (30-F11, APC-Cy7, B266564; 104, BUV737, 9051755; 104, 
PE-Cy7, B268066; A20, APC, B254.042; 30-F11, BV605, B278000), 
anti-CD3 (17A2, BV605, B264993; 145-2C11, APC-Cy7; 17A2, BUV737, 
9042537; 17A2, biotin, B259691), anti-CD4 (GK1.5, Pacific Blue, 
B199050; GK1.5, BUV496, 8080653), anti-CD8 (53-6.7, BV711, B259953; 
53-6.7, BUV395, 8306672), anti-IFNy (XMGI1.2, BV711, B236526), anti- 
GZMB (GB11, FITC, B275568), anti-TNF (MP6-XT22, PE-Cy7, B251190), 
anti-IL-2 (JES6-5H4, APC, B248053), anti-CD44 (IM7, A700, B244378), 
anti-TBET (4B10, BV711, B268785), anti-TIM3 (RMT3-23, BV605, 
B262042), anti-FOXP3 (FJK-16 s, BV421, B266620), anti-TCF7 (C63D9, 
A488, 8, Cell Signaling Technology), anti-PD-1 (29F.1A12, APC-Cy7, 
B260172), anti-RORyT (B2D, APC, E16663-102), anti-CD40 (PE, E028955), 
anti-CD80 (16-10A1, BUV395; 16-10A1, FITC, E029730), anti-CD86 (APC, 
B175381), anti-CX3CR1(SAO11F11, BV421, B231871), anti-Ly6C (AL21, 
FITC, 33380), anti-CD11c (N418, PE-Cy7, B264758; BV421, B264454), 
anti-CD11b (M1/70, PE, B228654), anti-Ly6G (IA8, APC-Cy7, B153128), 
anti-MHCII (MS5/114.15.2, A700, B264454), anti-CD64 (X54-5/7.1, APC, 
B254424), anti-B220 (RA3-6B2, BUV496, 8096734), anti-NK1.1 (PK136, 
biotin, B255213), anti-CD19 (6D5, biotin, B250292), anti-F4/80 (BM8, 
biotin, B253458), anti-podoplanin (eBio8.1.1, PE, E11344-399, eBiosci- 
ence), anti-CD31 (390, A647, 8187629) and anti-AKT (pS473) (M89-61, 
BV421, 7198801) antibodies other than anti-TCF7 and anti-podoplanin 
were purchased from BioLegend or BD Biosciences. Anti-VEGFR-3 and 
anti-FLT-4 (FAB743P, PE, ACBF0117091) antibodies were purchased 
from R&D Biosciences. The emv2-env tetramer (K°-restricted peptides 


aa 604-611 of p15E protein (KSPWFTTL)) was made through the NIH 
tetramer core facility. KSPWFTTL peptide was made by Biomatik Corpo- 
ration. The depletion antibodies anti-CD4 (GK1.5), anti-CD8 (YTS169.4), 
anti-PD-1(RMP1-14), anti-CTLA4 (9H10), anti-TIM3 (RMT3-23) and anti- 
4-1BB (LOB12.3) were purchased from Bio X Cell. 


VEGFC mRNA construct 

The VEGFC mRNA construct (Extended Data Fig. 2) was made by 
TriLink BioTechnologies (with full substitution of pseudouridine and 
5-methylcytosine bases), capped using CleanCap Reagent AG and poly- 
adenylated (120A). mRNA was mixed at a ratio of 1 pg per 0.1 pl of in vivo- 
JETPEI (Polyplus Transfection) and vortexed for 30 s and incubated in 
room temperature for 15 min before use. Control mRNA (Cy5-labelled 
GFP, GFPmRNA; luciferase, Luc mRNA) was also purchased from TriLink 
Biotechnologies. Recombinant proteins (VEGF-A, VEGF-B, VEGF-Cs and 
VEGF-D) were purchased from R&D Systems. 


Tissue processing and microscopy 

Meningeal lymphatic vessels were detected on whole-mount prepa- 
rations of the dura matter. Skull-cap samples were dissected, fixed 
in 2% PFA for one hour and immediately processed in a blocking 
solution (10% normal donkey serum, 1% bovine serum albumin, 0.3% 
PBS-Triton X-100) for overnight incubation at 4 °C. For detection of 
lymphatic vessels, samples were incubated with the primary antibody 
anti-LYVE-1 (AngioBio, 11-034, 1:400), overnight at 4 °C, then washed 
five times at room temperature in PBS and 0.5% Triton X -100, before 
incubation witha fluoro-conjugated secondary antibody (Alexa Fluor 
anti-rabbit 647, Thermo Fisher Scientific, 1:500) diluted in PBS and 
5% normal donkey serum. Meningeal lymphatic vessel images were 
acquired using a spinning-disk confocal (Nikon Eclipse Ti). Quantita- 
tive analysis of meningeal lymphatic coverage was performed using 
either FIJI or ImageJ image-processing software (NIH or Bethesda). 
LYVE-1‘ macrophages that are less intensely labelled than lymphatic ves- 
sels were eliminated by adjusting image contrast. Otsu’s thresholding 
method was then used to convert captured images into binary images. 
The fluorescence-labelled area covered by meningeal lymphatic ves- 
sels was measured in the confluence of sinuses and the sagittal sinus 
regions, and was normalized to the average fluorescence of the same 
regions of meningeal lymphatic vasculature in AAV-CTRL-treated mice. 
For brain sections, anti-CD3 (17A2, biotin), anti-CD31 (2H8, GeneTex) 
and anti-LYVE1 (AF2125, R&D Systems) antibodies were used with anti- 
streptavidin (FITC, BD Biosciences, 4031801), anti-hamster (127-165- 
160, Cy3, Jackson ImmunoResearch, 128827) and anti-goat (705-175-147, 
Cy5, Jackson ImmunoResearch, 138513) secondary antibodies, respec- 
tively. Confocal images were taken on a Leica SP8 microscope. Three- 
dimensional (3D) rendering was completed using Imaris 8 software 
(Oxford Instruments). 


Western blot 

HEK293T cells were transfected with the VEGFC mRNA construct com- 
bined with lipofectamine. Samples were lysed in RIPA buffer and boiled 
for 5 min with sample buffer. For in vivo delivery, VEGFC mRNA with 
JETPEI was used. The CSF was pooled from 10 mice and spun down 
to remove cells; the CSF was then filtered using a 100k Amicon fil- 
ter and the wash-through was boiled with sample buffer for western 
blotting. Western blotting was performed ina similar manner to that 
previously reported’. In brief, 15% gels were used and run at 10 mA 
per gel for 30 min and 40 mA per gel until separation of ladder. Wet 
transfer was performed at 120 mA per gel for 90 min on ice. Anti-VEGF-C 
antibody was used at aconcentration of 1:1,000 (R&D Systems, AF752) 
and incubated overnight in the cold room. After washing, HRP-conju- 
gated anti-goat secondary antibodies were used at a concentration of 
1:5,000 at room temperature for 2 h and imaged using the ChemiDoc 
MP imaging system (Bio-Rad). 
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Tumour inoculation 

Mice were anaesthetized using a mixture of ketamine (50 mg kg”) 
and xylazine (5 mg kg”), injected intraperitoneally. Mice heads were 
shaved and then placed in a stereotaxic frame. After sterilization of 
the scalp with alcohol and betadine, a midline scalp incision was made 
to expose the coronal and sagittal sutures, and a burr hole was drilled 
2 mmlateral to the sagittal suture and 0.5 mm posterior to the bregma. 
A10-pl Hamilton syringe loaded with tumour cells was inserted into 
the burr hole at a depth of 2.5 mm from the surface of the brain and 
left to equilibrate for 1 min before infusion. A micro-infusion pump 
(World Precision Instruments) was used to infuse 3 pl of tumour cells 
at1 pl min™. Once the infusion was finished, the syringe was left in place 
for another minute before removal of the syringe. Bone wax was used 
to fill the burr hole and the skin was stapled and cleaned. Following 
intramuscular administration of analgesic (meloxicam and buprenor- 
phine, 1 mg kg”), mice were placed ina heated cage until full recovery. 


Intracisterna-magna injection and collection of CSF 

For intracisterna-magna injections, mice were anaesthetized using 
ketamine and xylazine, and the dorsal neck was shaved and cleaned 
with alcohol. A 2-cm incision was made at the base of the skull, and the 
dorsal neck muscles were separated using forceps. After visualization 
of the cisterna magna, a Hamilton syringe with a 15-degree, 33-gauge 
needle was used to puncture the dura. Three microlitres of AAV9 (3 x10” 
viral particles per mouse) or MRNA vector (4-5 pg) was administered 
per mouse at arate of 1 pl min”. After completion of the injection, the 
needle was left in to prevent backflow for an additional 3 min. For CSF 
collection, a custom-pulled micropipette (0.75/1 1brl GF; Stoelting) 
was used to penetrate the dura and ensure no blood was collected. The 
skin was stapled and cleaned and the same postoperative procedures 
were performed as for the tumour inoculations. 


Ligation of deep cervical lymph nodes 

For ligation of lymph nodes, mice were anaesthetized using ketamine 
and xylazine, and the rostral neck was shaved and disinfected. A 2-cm 
incision was made and the salivary glands containing the superficial 
cervical lymph nodes were retracted and the deep cervical lymph 
nodes were visualized. The afferent lymph vessels were tied off 
with a 4-0 Vicryl suture, and then cauterized. The incision was closed 
with a 4-0 Vicryl suture and mice were subjected to the same postop- 
erative procedures as above. 


Flank tumour inoculation 

Mice were anaesthetized using ketamine and xylazine, and the flank 
was Shaved and disinfected. A 1-ml syringe with a 30G needle was 
used to deliver 100 pl of 500,000 cells subcutaneously. For GL261-Luc 
cells, cells were mixed in a 1:1 volume with Matrigel (Corning) before 
delivering. 


Adoptive transfer 

To evaluate memory against tumour using adoptive transfer, T cells 
from deep cervical lymph nodes and spleens of mice that rejected 
tumours after treatment with VEGFC mRNA and anti-PD-1 were isolated 
using the EasySep Mouse T cell Isolation Kit (StemCell Technologies) 
andtransferred into naive mice 24 h before tumour inoculation (all the 
T cells from one donor were transferred to one recipient). 

To study leukocyte trafficking after treatment with VEGF-C, CD45.2 
mice were inoculated with GL261 tumours. At day 7, mice were treated 
with GFP mRNA or VEGFC mRNA. Seven days after treatment, deep 
cervical lymph nodes were collected and filtered through 70-pm filter 
paper, and whole-leukocyte suspensions (30 million cells per mouse; 
approximately 3-5 deep cervical lymph nodes transferred into one 
mouse) were adoptively transferred in to CD45.1 mice bearing 7-day 
tumours. After transfer, mice were treated with GFP mRNA or VEGFC 


mRNA intracisternally. Five days later, deep cervical lymph nodes 
and brain tissue were collected to evaluate immune cell trafficking. 
Five minutes before euthanizing the mouse, 25 pg of anti-CD45 PE 
(30-F11, PE, Biolegend) antibodies were administered intravenously 
to stain circulating immune cells. 


IVIS imaging 

Mice were anaesthetized using isoflurane and injected intraperitoneally 
with RediJect D-Luciferin Ultra (PerkinElmer) (200 pI, 30 mg mI’). After 
10 min, mice were imaged using the IVIS Spectrum In Vivo Imaging 
System (PerkinElmer). 


RNA-seq 

RNA-seq data were aligned using STAR (STAR/2.5.3a-foss-2016b, mm10 
assembly) with parameters:-runThreadN 20-outSAMtype BAM Sort- 
edByCoordinate-limitBAMsortRAM 35129075129-outFilterMultimap- 
Nmax 1-outFilterMismatchNmax 999-outFilterMismatchNoverLmax 
0.02-alignIntronMin 20-alignIntronMax 1000000-alignMatesGap- 
Max 1000000 for mapping of repetitive elements. Counts were counted 
using BEDTools (BEDTools/2.27.1-foss-2016b) (coverageBed function), 
normalized using DESeq2 and graphed using the Morpheus web tool 
from the Broad Institute. Human RNA-seq data were obtained from 
TCGA and GTEX databases and analysed using the above parameters 
(hg38 assembly) (Extended Data Fig. 2a—c), and survival stratified by 
VEGF-C expression was analysed using OncoLnc (www.OncoLnc.org) 
(Extended Data Fig. 2e, f). 


Isolation of mononuclear cells and flow cytometry 

Tissue was collected and incubated in a digestion cocktail contain- 
ing 1 mg mI‘ collagenase D (Roche), 1 mg ml‘ collagenase A (Roche) 
and 30 pg ml DNase (Sigma-Aldrich) in complete RPMI (10% FBS) at 
37 °C for 30 min. Tissue was then filtered through a 70-um filter. For 
brain tissues, cells were mixed in 4 ml of 25% Percoll (Sigma-Aldrich) 
solution and centrifuged at 530g for 15 min without a brake. The Per- 
coll layer was removed and cells were diluted in 5 ml of 1% BSA. Cells 
were treated with ACK buffer and resuspended in 1% BSA. At this point, 
cells were counted using an automated cell counter (Thermo Fisher 
Scientific). 

For tetramer experiments, staining was performed with antibod- 
ies (1:200) and tetramer (1:50) for 60 min at room temperature. Cells 
were washed to remove excess antibodies and resuspended in 1% BSA 
with 10 pl of CountBright absolute counting beads (Life Technologies) 
for multiparameter analyses on the LSRII flow cytometer (Becton Dick- 
inson), and then analysed using FlowJo software (10.5.3, Tree Star). 
For calculation of tetramer-positive T cells in each organ the following 
formula was used: number of tetramer-positive T cells x (no. of input 
beads/no. of counted beads) x (no. of cells from automated counter/ 
no. of total events in flow cytometry). 

For cytokine stimulation, surface markers were first stained on ice 
for 30 min. After washing, cells were stimulated in complete RPMI 
with 200 pl of 1x eBioscience Cell Stimulation Cocktail (Thermo Fisher 
Scientific) without protein transporter inhibitor for 1h at 37 °C. Fifty 
microlitres of 5x eBioscience Cell Stimulation Cocktail with protein 
transporter (Thermo Fisher Scientific) was added and incubated 
for an additional 4 h. Cells were then fixed with 100 pl 2% formalde- 
hyde on ice for 45 min. Cells were washed with 1x Perm/Wash Buffer 
(BD Cytofix/Cytoperm, BD Biosciences), and then permeabilized with 
1x Perm/Wash Buffer (BD Cytofix/Cytoperm, BD Biosciences) for 10 min 
onice. Intracellular antigens were stained on ice for 30 min. 

For transcription factor staining, surface markers were first stained 
onice for 30 min. Cells were then fixed with 100 pl 2% formaldehyde on 
ice for 45 min. Cells were washed with 1x Perm/Wash Buffer (eBiosci- 
ence FOXP3/Transcription Factor Staining Buffer Set, Thermo Fisher 
Scientific), and then permeabilized with 1x FOXP3 Perm/Wash Buffer 
for 10 min onice. Intracellular antigens were stained on ice for 30 min. 


For AKT phosphorylation staining, surface markers were first stained 
onice for 30 min. Cells were then fixed and washed following the direc- 
tions of the BD Phosflow kit. Samples were run on an Attune NxT flow 
cytometer. 


Enzyme-linked immunosorbent assay 

Enzyme-linked immunosorbent assay (ELISA) was performed using 
a mouse VEGF-C ELISA Kit (Cusabio LLC, E07361m-96) following the 
manufacturer’s instructions. 


mRNA tropism 

Mice were injected in the cisterna magna with Cy5-labelled GPP MRNA 
with JETPEI. A day (24 h) after injection, brains, meninges and lymph 
nodes were collected for flow cytometry. Samples were run onan Attune 
NxT flow cytometer. 


Blood-brain barrier permeability 

Mice were injected intravenously with fluorescein-labelled dextran 
(molecular weight, 70,000 kDa) (Thermo Fisher Scientific) or 0.5% 
Evans Blue. For microscopy, brains were collected 2 h later and flash- 
frozen for sectioning. For Evans Blue quantification, mice were per- 
fused with ice-cold PBS intraventricularly (heart) and Evans Blue was 
extracted using dimethylformamide. The relative absorbance was 
measured using a SpectraMax i3 platform (Molecular Devices) at 
620-nm wavelength. 


T cell proliferation 

T cells were isolated using the EasySep Mouse T cell Isolation Kit 
(StemCell Technologies). T cells were stained using CellTrace Violet, 
and stimulated with CD3/CD28 antibodies from Bio X Cell for 24 hin 
complete RPMI. After 24 h, cells were resuspended in medium contain- 
ing IL-2 and VEGF-C for 4 days and FACS was performed at the end of 
the 4 days. 


Culture and stimulation of bone-marrow-derived dendritic cells 
Bone marrowwas collected from wild-type mice and cultured in com- 
plete RPMI supplemented with granulocyte-macrophage colony-stim- 
ulating factor for six days. After 6 days, cells were stimulated with LPS 
and VEGF-C for 24 h and FACS was performed. 


Statistical analysis 

For revision experiments, power analysis was completed to estimate 
sample size. Asummary of key phenotypes that we observed through- 
out multiple experiments were graphed. These experiments showed 
response rates of around 70%, and the sample size calculation required 
for an a value of 0.05 was n = 6 (power 95%) and n=5 (power 90%). 

For alltumour-related experiments with luciferase constructs, mice 
were measured for tumour size at day 4-6, and randomized into groups 
to ensure similar tumour size. For experiments in which mice were 
pooled, those mice within each pool were also ensured to have tumours 
of asimilar size to minimize variation in downstream analysis. 

Mice that underwent treatment were randomized after the measure- 
ment of tumour size at day seven. Investigators were not blinded for 
the majority of the study; however, all survival studies were monitored 
with the help of veterinarians from the Yale animal facility who were 
blinded to the studies and reported endpoints accordingly. In addition, 


animals were in mixed cages and labelled with numbers that blinded 
allinvolved with scoring mice health. 

Survival curves were analysed using a log-rank (Mantel-Cox) test. 
For other data, normally distributed continuous variable comparisons 
used a two-tailed unpaired Student’s t-test or paired Student’s t-test 
with Prism software. 


Graphical illustrations 
Graphical illustrations were made using BioRender (https://biorender. 
com/). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


No new sequencing data were generated for this study. All datasets 
generated and/or analysed during the current study are available in the 
Article, the Source Data files that accompany Figs. 1-4 and Extended 
Data Fig. 1-10, or the Supplementary Information. 


Code availability 


All of the code used for analysis is described in the methods. Detailed 
files willbe made available from the corresponding authors on request. 
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Extended Data Fig. 1| Increased meningeal lymphatic vasculature confers 
protection against intracranial glioblastoma challenge and provides 
long-term immunity without perturbance of the blood-brain barrier. 

a,b, Mice inoculated with 50,000 GL261-Luc cells were imaged every 7 days 
and showed consistent and reliable tumour growth (n=4).c, GL261-Luc cells 
result in lethality in mice in acell-number-dependent manner (500 cells, n=5 
mice; 5,000 cells, n=5 mice; 50,000 cells, n=9 mice). d, Mice were injected 
intravenously with dextran-fluorescein (molecular weight, 70,000 kDa) and 
euthanized after 2 h. Brains were collected and cryosectioned (n=4) The 
experiment was repeated independently with similar results. e, Mice were 
injected intravenously with 0.5% Evans Blue. After 2h mice were perfused 
intraventricularly and Evans Blue was extracted from brain tissue using 
dimethylformamide (wild type, LPS, AAV-VEGF-C, VEGFC mRNA, n=4; tumour, 
tumour + VEGFC mRNA, n=5). BBB, blood-brain barrier. f, Representative 
images of AAV-CTRL and AAV-VEGF-C-treated mice after implantation of 5,000 
cells. The experiment was repeated independently with similar results. 

g, Monitoring of the long-term survival of mice after AAV-VEGF-C and AAV- 
CTRL injections into the cisterna magna (n=5).h, i, CS7BL/6 mice received an 
injection of AAV-CTRL or AAV-VEGF-C through the cisterna magna. Six to eight 
weeks later, mice were euthanized and the dura was collected to image the 
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lymphatic vasculature (LYVE1*) in the superior sagittal sinus (AAV-CTRL, n=4; 
AAV-VEGF-C, n=5).j, CS7BL/6 mice that had been injected with CTRL-AAV or 
AAV-VEGF-C two months previously were implanted with 50,000 GL261-Luc 
cells inthe striatum and monitored for survival (AAV-CTRL, n= 4; AAV-VEGF-C, 
n=5).k, AAV-CTRL- or AAV-VEGF-C-treated mice were depleted of CD4 or CD8& 
Tcells using anti-CD4 (GK1.5) or anti-CD8 (YTS169.4) antibodies starting one 
day before tumour inoculation (GL261) and redosed every four days afterwards 
(AAV-CTRL, n=4; AAV-VEGF-C, n=5; AAV-VEGF-C + anti-CD8, n=4; AAV- 
VEGF-C +anti-CD4,n=5).1, pMT B-cell-deficient mice were injected with AAV- 
CTRL or AAV-VEGF-C and challenged with 50,000 GL261-Luc cells two months 
afterwards (AAV-CTRL, n=5; tMT AAV-CTRL, n=3; LMT AAV-VEGF-C, n=5). 

m, Top, schematic of the schedule of procedures for the experiments below 
(bottom panel) and in Fig. 1f. Mice injected with AAV-CTRL or AAV-VEGF-C that 
survived over 100 days after challenge with 5,000 GL261-Luc cells were 
rechallenged with 500,000 GL261-Luc cells in the flank. Bottom, IVIS imaging 
of mice ten days after flank rechallenge. Data are pooled from two independent 
experiments (h-m) and are meants.d.*P<0.05;**P<0.01;***P< 0.001; 

****P < 0.0001 (two-tailed unpaired Student’s t-test or two-sided log-rank 
Mantel-Cox test). 
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Extended Data Fig. 2| Correlation of VEGF-C expression profiles between 
human and mouse glioblastoma. a-c, RNA-seq data of tumour tissue and 
healthy brain tissue from different regions (TCGA, study accession phsO00178. 
v10.p8, and GTEX, study accession phs000424.v7.p2, respectively. 

a, b, Expression profiles of VEGF-A (a) and VEGF-C (b) (glioblastoma (GBM), 
n=147; cortex, n=133; amygdala, n= 81; Brodmann areas 24 and 9,n=215; C1 
segment, n=75; caudate nucleus, n=135; cerebellar hemisphere, n=115; 
cerebellum, n=146; hippocampus, n=103; hypothalamus, n=101; nucleus 


accumbens, n=125; putamen, n=103; substantia nigra, n=72; tibial nerve, 
n=329).c, RNA-seq data of mouse healthy brain and GL261 tumours were 
analysed (n=3 biologically independent samples). d, ONCLNC (www.oncolnc. 
org) data of patients with glioblastoma stratified into two groups (VEGF-C low, 
lower 33%; VEGF-C high, upper 33%; n= 50). e, Kaplan-Meier survival curves of 
patients fromd (n=50). Dataare mean+s.d.*P<0.05;**P<0.01;***P< 0.001; 
****P < 0.0001 (two-tailed unpaired Student’s ¢-test, two-sided log-rank 
Mantel-Cox test or Pearson’s correlation). 
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Extended Data Fig. 3| Validation of VEGFC mRNA in vitro, andits in vivo 
uptake and expression tropism. a, The complementary DNA (cDNA) sequence 
of VEGFCthat was used for mRNA production. b, 5-Methylcytosine and 
pseudouridine were used to substitute all of the cytosine and uridine bases in 
the mRNA. c, The VEGFCmRNA construct was transfected into HEK293T cells 
and cell lysate and the medium was collected to detect full-length and secreted 
intermediate and processed peptides. d, The VEGFCmRNA construct (5 11g) 
was delivered in vivo into the cisterna magna of mice using aJETPEI polyplex 
system. Six hours later, mouse CSF was collected using a capillary tube and 
filtered with an amicon filter, and the wash-through was used toruna western 
blot (each lane is n=10 mice pooled). e, Raw western blot images for Extended 
Data Fig. 4c (gel 1: column1, control; column 2, CyS-GFP mRNA; column 3, 
VEGFC mRNA; column 4, control; column 5, CyS-GFP mRNA; column 6, 

VEGFC mRNA; gel 2: column 1, control; column 2, CyS-GFP mRNA; column 3, 
VEGFC mRNA) and Extended Data Fig. 4d (gel 3: columns 1-3, Cys-GFPmRNA; 
columns 4-6, CyS-GFP mRNA; columns 7-9, recombinant human VEGF-Cin 
increasing concentration). The experiments were repeated twice 
independently with similar results. f, VEGFC mRNA and CyS5-labelled GFP MRNA 


& 


were mixed at a1:1 ratio and delivered to mice in vivo using JETPEI. Fifteen 
minutes later, mice were euthanized and the whole skull cap was imaged to 
observe the distribution of mRNA particles. The experiments were repeated 
independently twice with similar results. g,h, VEGFC mRNA and CyS-labelled 
GFP mRNA were mixed at a1:1 ratio and delivered to mice in vivo using JETPEI. 
After 24h, the brains, meninges and lymph nodes of treated mice were 
collected for flow cytometry to measure the percentage of Cy5-positive cells in 
each compartment (control, n= 6; Cy5-labelled mRNA, n= 9; data are pooled 
from two independent experiments). i, The CSF, meninges, brainand serum 
were collected after two months (AAV-CTRL, AAV-VEGF-C), after 24h 

(GFP mRNA, VEGFC mRNA) or at days 7 and 28 after tumour inoculation, and an 
ELISA was performed to detect VEGF-C (CSF: AAV-VEGF-C, n= 6; other groups, 
n=3;5 mice were pooled for each sample; meninges: AAV-VEGF-C, n= 6; 7-day 
tumour, n=3; other groups, n=5; brain: AAV-CTRL, GFP MRNA, n= 6; AAV- 
VEGF-C, VEGFCmRNA, n=5; 7-day tumour, n=3; 28-day tumour, n=7; serum: 
n=3).Dataare mean +s.d.*P< 0.05; **P< 0.01; ***P< 0.001; ****P< 0.0001 (two- 
tailed unpaired Student’s t-test). 


{ Single cell 1 b Baseline: MFI 1064, 9.82% Meninges 


ey AAV-VEGF-C| '*° tumor + Luc-mRNA] "4 tumor + VEGF-C-mRNA 


Single cell 1 Single cell 2 


60 __ 12.6% 60: 


oli n0728 LEC] , 
-10' 0 10" 10° 10 0 10° 10° 


Baseline: MFI 313, 0.96% 


elie Mihi | base | AAV-VEGF-C| '” turpor + Luc-mRNA| “4 tumor +VEGF-C-mRNA 
[+ wo 0 ey 
= a 2.63% | w 1.40% | a 2.86% 
i 1 1 40+} “0 40 
ao 2 
- Mel 468 BEC ‘s Me 489 BEC ‘ IML 472 BEC. 
CD45- 2 


Single cell 2 CD45- 
2 | LEC | : 
1 oF Toe oF 1 
j d q a“ 
CD45+ 5 AKT-phos-BV421 
? o ] o =@® Soot mee Baseline: MFI 563, 2.27% Lymph node 
zr4 A ¥f zm BEC zm ia wa] AAV-VEGF-C] tumor + Luc-mRNA] 7 tumor +)VEGF-C-mRNA 
3} a a oof ° » 
a a a a . 9, 
D | peer perrrrerer rare rr [ruil Uanaaeeheauneeenaannaaaad a _ 452% | 0.82% | 4.04% 
CD45-BV605 CD31-A647 CD31-A647 *4 * * 
2 2» 2» 
IMEL 104] IML 563 LEC 0 IMEL 663 LEC 
Cc Meninges LEC Meninges LEC Meninges BEC Meninges BEC i “<r eT eran 
zs « P=0.03 ns_P=0.058 
Se 80- ae 5000 5 2 80 5000- Baseline: MFI 349, 4.59% 
25 60. 5 4000 raya 4000: 0] AAV-VEGF-C] tumor + Luc-mRNA] “{ tumor 4VEGF-C-mRNA 
ca 3.2x 5 2.1x ° S o 60: 
_ g S00 0 %0 0 
8 40. 24x E sone 16x SS 40. g é 1.65% e 2.34% ‘ 2.04% 
ES 2000 es 2000: ak = == 
s3 20- + a 1000 | ri 38 20 4000. : : i 
& ELLIE ot, tL, © olx se ol tA {MEI 337 BEC]  ,jmei333 BEC] , MeL 341 BEC 
LN LEC LN LEC LN BEC LN BEC fener Tor oF oe canna oe toe Toe 
80 5000. Za S000: AKT-phos-BV421 
Ao pas 
Ge oo] ___+ Peo0z 4000 peooor Se poy d tumor tumor 
. Ps 
BE a P=0.008 gg 3000) —— Be 40. iz 3000 Luc-mRNA VEGF-C-mRNA 
ifs) * p-00007 = 2000 P= 0.08 68 = 2000. 
= <i 
s3 20) 2x oa 7x 1000: a Wax ee 77 |e , 1000 
S obey oe OO et olf ll 8 Pa de a 
2 FP oe L gt LF & 2 a 
© S&é “SE ra “S é&€é s “S es 
oe g& et oe EF wo 
Fee? ¥ se Fe ¥ oe 
x es & x & & 
Ss ws ws Ss 
q s 
s < Ss se cq 
~Z 
f : 
% 100 Nox yy 58 
2 -d60 AAV — -d60 AAV a 
ec 80 == dO AAV 
2 dd AAV — d7 AAV s 
60 se dOmRNA 
<c d7 AAV — d3mRNA 5 & 
S 40 dO mRNA) — 47 mena § 9 
= oO 
® 90 d3 mRNA w 
a d7 mRNA 


Oo 


@L269-La@9 30 40 50 60 
Days elapsed 


= 


g d7 di4_ d21_ 28 


0 0 
0 5 10 15 20 25 30 0 5 10 15 20 25 30 
Days Days 


11084 -460 AAV 4105 0 AAV 


10 15 20 25 30 


Days 


: dO mRNA 


0 Z ol: send 
0 5 10 15 20 25 30 0 5 10 15 20 25 30 
Days Days 


64 3! 64 d7 mRNA 
110% d3/mRNA 7 1105 ~ ae 


ol 0 - 
0 5 10 15 20 25 30 0 5 10 15 20 25 30 
Days Days 


Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | VEGF-C signals specifically in lymphatic endothelial 
cells inthe meninges and deep cervical lymph nodes and provides survival 
benefits in a manner that depends on the administration time point. 

a, Gating strategy for LECs and BECs. b, Concatenated FACS plots of LECs and 
BECs from meninges and lymph nodes, depicting AKT phosphorylation 
intensity. The experiment was repeated independently with similar results. 
c, Quantification of the AKT(pS473)-positive population and mean 
fluorescence intensity (MFI) within LECs and BECs in the meninges and 

deep cervical lymph nodes (meninges: wild type, n=5; AAV-VEGF-C, 

tumour + Luc mRNA, tumour + VEGFC mRNA, n= 8; lymph nodes: wild type, 

n=5; AAV-VEGF-C, n=8; tumour + Luc mRNA, n=7; tumour + VEGFC mRNA, 


n=8).d, Fluorescence microscopy images of deep cervical lymph nodes after 
treatment with VEGFC mRNA in tumour-bearing mice (CD31, red; LYVE1, green; 
DAPI, blue). e, Fluorescence microscopy images of meninges after treatment 
with VEGFC mRNA in tumour-bearing mice (CD31, red; LYVE1, green; DAPI, 
blue). The experiment was repeated independently with similar results. 

f-h, Mice were treated with AAV-VEGF-C or VEGFCmRNAat different time 
points relative to GL261-Luc tumour inoculation (day 0). Tumour growth 
kinetics (g, h) and survival (f) were monitored (n=5 for all groups, notx refers 
to mice receiving no treatment). Data are mean +s.d.*P< 0.05; **P< 0.01; 
***P < 0.001; ****P< 0.0001 (two-tailed unpaired Student’s t-test or two-sided 
log-rank Mantel-Cox test). 
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Extended Data Fig. 5 | Therapeutic delivery of VEGF-C potentiates 
checkpoint inhibitor therapy even at late stages of tumour development. 

a, Related to Fig. 3c, d. Quantification of the number of cells per tumour- 
bearing brain hemisphere or lymph node using CountBright absolute counting 
beads and autocounter (see Methods for details) (lymph nodes: GFP mRNA, 
n=6; VEGFCmRNA, n=12; tumour: GFP MRNA, n=3; VEGFCMRNA, n=5). 

b, Mice that rejected tumours after combination therapy with VEGFC mRNA 
and anti-PD-1 (RMP1-14) were rechallenged in the contralateral hemisphere and 
observed for survival (naive, n=5; day 100 rejected, n=4).c, T cells fromlymph 
nodes and spleens from mice that rejected tumours after combination therapy 
with VEGFC mRNA and anti-PD-1 (RMP1-14) or naive wild-type mice were 
isolated and transferred into naive wild-type mice intravenously. After 24h, 
GL261 tumours were inoculated intracranially and the mice were observed for 
survival (wild type, n=5; wild type with transfer of naive T cells, n=5; wildtype 
with transfer of memory T cells, n=7).d, Mice inoculated with 50,000 GL261- 
Luc cells were treated with VEGFC mRNA or GFP mRNA (day 7) and with either 
anti-PD-1(RMP1-14) antibodies or isotype antibodies (days 7, 9 and 11), and 
monitored for survival. Mice were depleted of CD4 or CD8 T cells using anti- 
CD4 (GK1.5) or anti-CD8 (YTS169.4) antibodies starting one day before tumour 
inoculation and redosed every four days afterwards (VEGFC mRNA + anti-PD-1, 
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n=6; GFP mRNA + anti-PD-1, n= 6; VEGFC mRNA + anti-PD-1+ anti-CD4, n=5; 
VEGFC mRNA + anti-PD-1+ anti-CD8,n=5).e, Schematic of experimental design 
for the results shown in fand g. f, g, Mice inoculated with 50,000 CT-2A-BFP 
cells (f) or CT-2A cells (g) were treated with VEGFC mRNA or GFP mRNA (day 7) 
and either with PBS or with anti-PD-1(RMP1-14) and/or anti-4-1BB (LOB12.3) 
antibodies (days 7, 9 and 11), and monitored for survival (f, VEGFC mRNA + anti- 
PD-1+ anti-4-1BB, n=5; GFP mRNA + anti-PD-1+ anti-4-1BB, n=5; VEGFC mRNA + 
PBS,n=4; GFPmRNA + PBS, n= 6;g,n=5 forall groups except VEGFC mRNA+ 
anti-4-1BB + anti-PD-1, n=7).h-j, Mice inoculated with 50,000 GL261 cells were 
treated with VEGFC mRNA or GFP mRNA (day 7) and either with PBS or with anti- 
PD-1(RMP1-14) (h), anti-TIM3 (RMT3-23) (i) or anti-CTLA4 (9H10) antibodies (j) 
(days 7, 9 and 11), and monitored for survival (n=5). Foriandj, the same control 
mice were used for the GFP mRNA + PBS and VEGFCmRNA + PBS groups. 

k, Schematic of experimental design for the results shown inI.1, Mice 
inoculated with 50,000 GL261-Luc cells were treated with VEGFC mRNA or GFP 
mRNA (day 20) and either with PBS or with anti-PD-1 (RMP1-14) and anti-TIM3 
(RMT3-23) antibodies (days 20, 22 and 24), and monitored for survival (n=5). 
*P<0.05;**P< 0.01; ***P< 0.001; ****P< 0.0001 (two-tailed unpaired Student’s 
t-test or two-sided log-rank Mantel-Cox test). 
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Extended Data Fig. 6 | Validation of endogenous retrovirus EMV2asa 
tumour antigen for GL261. a, RNA-seq analysis of mouse endogenous 
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d, Mice were injected with 500,000 GL261 cells or PBS in the flank. Seven days 
after tumour inoculation, draining inguinal lymph nodes were collected and 


retrovirus elements in publicly available datasets on C57BL/6J mice brains and emv2-env (K°-restricted peptides aa 604-611 of p15E protein (KSPWFTTL)) 


GL261 cell lines from various sources. b, Quantification of endogenous 
retrovirus elements in healthy brain and tumour tissue, from RNA-seq 
(brain, n=9; tumour, n=6).c, Gating strategies for tetramer staining. 


tetramers were used to validate tumour-specific T cell proliferation. The 
experiments were repeated twice independently with similar results. Data are 
mean +s.e.m. 
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Extended Data Fig. 7 | See next page for caption. 
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Extended Data Fig. 7 | VEGF-C-dependent anti-PD-1 potentiation is specific 
to VEGF-C among proteins of the VEGF family, and is not caused by a direct 
effect ontumour or immune cells. a, b, C57BL/6 mice received an injection of 
AAV-CTRL or AAV-s(oluble) VEGFR-3 intracisternally through the cisterna 
magna. After four weeks, mice were euthanized and the dura mater was 
collected to image the lymphatic vasculature (LY VE1) in the confluence of 
sinuses (a). The relative area of lymphatic vasculature in the confluence of 
sinuses was quantified (b) (n=5).c, Mice were pretreated with AAV-sVEGFR-3 
four to six weeks before tumour inoculation. Seven days after tumour 
inoculation, mice were treated with VEGFC mRNA and anti-PD-1 (RMP1-14) 
antibodies (days 7, 9 and 11) (n=5). d-f, Mice were treated with5 pg of 
recombinant protein (VEGF-A, VEGF-B, VEGF-Cs or VEGF-D) in combination 
with anti-PD-1(RMP1-14) antibodies (days 7, 9 and 11) and monitored for survival 
and tumour growth (n=5). g-k Mice were injected with CT-2A-BFP tumours 
and were treated with VEGFC mRNA at day 7. On day 8, brains and lymph nodes 
from all mice were collected and analysed using flowcytometry. 


The experiment was repeated independently with similar results. g, Sample 
flow cytometry plots of experiments. h-k, Quantification of experiments 
(n=5).1, Flowcytometry was used to evaluate the expression of VEGFR-3in 
GL261 cells. A VEGFR3-GFP plasmid was transfected into HEK293T cells asa 
positive control. The experiment was repeated independently with similar 
results.m, MTT assay to measure the proliferation of GL261 cancer cells inthe 
presence of VEGF-C after 48 h (n=8 per group).n, Flowcytometry was used to 
evaluate the expression of VEGFR-3 in leukocyte compartments in the tumour. 
The experiment was repeated independently with similar results. 0, Bone- 
marrow-derived dendritic cells were cultured with VEGF-C and evaluated for 
the expression of costimulatory molecules in the naive state (top row) or with 
LPS stimulation (bottom row). p, Isolated T cells were activated in vitro with 
CD3 or CD28 and IL-2 in the presence of VEGF-C. Data are meants.d.*P<0.05; 
**P<0.01;***P< 0.001; ****P< 0.0001 (two-tailed unpaired Student’s f-test or 
two-sided log-rank Mantel-Cox test). 
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Extended Data Fig. 8 | Flowcytometry analysis of myeloid cell populations dendritic cell.e-g, MHC II and CD80 MFI levels were quantified and showed no 
after treatment with VEGF-C. Mice bearing 7-day tumours were treated with significant alteration after treatment with VEGFC mRNA. band eare leukocytes 
Luc mRNA or VEGFC mRNA and evaluated for changes in the populations of from brain tissue; cand f are leukocytes from draining cervical lymph nodes; 


myeloid cells.a, Gating strategy for different myeloid cells. b-d, Cell counts of dand gare leukocytes from meninges (n=3, 3 mice were pooled for each 
different cell types were measured at different time points after VEGFC MRNA replicate). Data are mean+s.e.m *P< 0.05; **P< 0.01; ***P< 0.001; ****P< 0.0001 
treatment. cDC, classical dendritic cell; DC, dendritic cell; pDC, plasmacytoid (two-tailed unpaired Student’s ¢-test). 
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Extended Data Fig. 9 | VEGF-C treatment changes T cell phenotypes and 
functionality. Mice bearing 7-day-tumours were treated with Luc mRNA or 
VEGFC mRNA and evaluated for changes. a, Mouse brains were collected two 
days after treatment with VEGFC mRNA, cryosectioned and analysed by 
immunofluorescence microscopy. b, Gating strategy for flow cytometry 
analysis of T cells. c, Example of TCF7 staining in CD3*CD8*CD44" cell 
populations after VEGFC mRNA treatment. The experiment was repeated twice 
independently with similar results. d, Percentage of TCF7’ T cells inthe 
CD3*CD8*CD44* population in the brain (Luc mRNA, n=14; VEGFCmMRNA, n=9; 
data pooled from 3 independent experiments). e, Number of CD3*CD8*CD44* 


cells producing IFNy, TNF, IL-2 and GZMB inthe brain (n=3; 3 mice were pooled 
for eachsample). Violin plots display quartiles (dotted horizontal lines), 
median (dashed line) and minimum and maximum values (solid lines). 


f, Quantification of cell counts in different compartments after treatment with 


VEGFC mRNA. Percentage of cells expressing specific transcription factors or 
immune checkpoint inhibitors after treatment with VEGFC mRNA. g, Gating 
strategy for cytokine productioninT cells. h,i, Quantification of T cells 
expressing multiple cytokines (n =3, 3 mice were pooled for each replicate). 
Data are mean ts.e.m. *P< 0.05; **P<0.01;***P< 0.001; ****P< 0.0001 
(two-tailed unpaired Student’s t-test). 
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Extended Data Fig. 10 | Therapeutic application of VEGF-C mediates 
protection against intracranial melanomacell lines when combined with 
various checkpoint inhibitors. a, Schematic of experimental design for 

Fig. 4a—c and b-e. b, Growth kinetics of flank tumours from Fig. 4a-c were 
measured using acaliper (n=12 for all groups except the ligation group, n=7). 
c-e, Mice were given either only B16 intracranial tumours (IC) (c) or botha B16 


intracranial tumour and aB16 flank tumour (FT) (d) and treated with GFP MRNA 
or VEGFC mRNA on day 7, and anti-PD-1 (RMP1-14), anti-CTLA4 (9H10) and anti- 
TIM3 (RMT3-23) on days 7, 9 and 11. f, Flow cytometry gating strategy for Fig. 4e, 
f. g, Schematic of VEGF-C-induced tumour rejection. *P< 0.05; **P< 0.01; 

***P< 0.001; ****P < 0.0001 (two-sided log-rank Mantel-Cox test). 


nat | J Te research Corresponding author(s): Akiko Iwasaki, Jean-Leon Thomas 


Last updated by author(s): November 6 2019 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size We graphed a summary of key phenotypes we observed throughout multiple experiments. With these two experiments showing ~70% 
response rates, the sample size calculation required for an alpha of 0.05 is N = 6 (power 95%) and N = 5 (power 90%). 


Data exclusions Data was only excluded when experiment collection machine failed and no data points were collected. as were not able to produce readable 
result files. 


Replication All experiments were repeated as a variation (changes in time points) or in smaller n numbers, and showed similar trends and experimental 
conclusions. For most experiments depicted, they are concatenated examples of all experiments performed (not true for experiments where 
variation between different days are higher, ie. MFI measurements). All experiments were reproducible. 


Randomization For all tumor-related experiments with luciferase constructs, mice were measured for tumor size at day 4-6, and randomized into groups to 
ensure similar tumor size. For experiments where animals were pooled, animals within each pool were also ensured to have similar size 
tumors to minimize variation in downstream analysis. 


Blinding Investigators were not blinded for majority of the study; however all survival studies were monitored with the help of Yale animal facility Vets 
who were blinded to the studies. In addition, animals were mixed and labeled with numbers that blinded all involved with scoring mice health. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Anti-CD45 (30-F11, APC-Cy7, B266564, 1:100; 104, BUV737, 9051755, 1:100; 104, PE-Cy7, B268066, 1:100; A20, APC, B254042, 
1:100; 30-F11, BV605, B278000, 1:100), anti-CD3 (17A2, BV605, B264993, 1:100; 145-2C11, APC-Cy7, 1:100; 17A2, BUV737, 
9042537, 1:100; 17A2, Biotin, B259691, 1:100), anti-CD4 (GK1.5, Pacific Blue, B199050, 1:100; GK1.5, BUV496, 8080653, 1:100), 
anti-CD8 (53-6.7, BV711, B259953, 1:100; 53-6.7, BUV395, 8306672, 1:100), anti-IFNy (XMG1.2, BV711, B236526, 1:100), anti- 
GZMB (GB11, FITC, B275568, 1:100), anti-TNFa (MP6-XT22, PE-Cy7, B251190, 1:100), anti-IL2 (JES6-5H4, APC, B248053, 1:100), 
anti-CD44 (IM7, A700, B244378, 1:100), anti-TBET (4B10, BV711, B268785, 1:100), anti-TIM3 (RMT3-23, BV605, B262042, 
1:100), anti-FOXP3 (FJK-16s, BV421, B266620, 1:100), anti-TCF7 (C63D9, A488, 8, Cell Signaling Technology, 1:100), anti-PD-1 
(29F.1A12, APC-Cy7, B260172, 1:100), anti-RORyT (B2D, APC, E16663-102, 1:100), anti-CD40 (PE, E028955, 1:100), anti-CD80 
(16-10A1, BUV395, 1:100; 16-10A1, FITC, E029730, 1:100), anti-CD86 (APC, B175381, 1:100), anti-CX3CR1(SA011F11, BV421, 
B231871, 1:100), anti-Ly6C (AL21, FITC, 33380, 1:100), anti-CD11c (N418, PE-Cy7, B264758, 1:100; BV421, B264454, 1:100), anti- 
CD11b (M1/70, PE, B228654, 1:100), anti-Ly6G (IA8, APC-Cy7, B153128, 1:100), anti-MHCII (M5/114.15.2, A700, B264454, 
1:200), anti-CD64 (X54-5/7.1, APC, B254424, 1:100), anti-B220 (RA3-6B2, BUV496, 8096734, 1:100), anti-NK1.1 (PK136, biotin, 
B255213, 1:100), anti-CD19 (6D5, biotin, B250292, 1:100), anti-F4/80 (BMB8, biotin, B253458, 1:100), anti-Podoplanin (eBio8.1.1, 
PE, E11344-399, eBioscience, 1:100), anti-CD31 (390, A647, 8187629, 1:100), anti-AKT (pS473) (M89-61, BV421, 7198801, 1:100) 
antibodies were purchased from BioLegend (Bxxxxxx) or BD Biosciences (xxxxxxx). anti-VEGFR-3/FLT-4 (FAB743P, PE, 
ACBF0117091, 1:100) antibodies were purchased from R&D biosciences. Kb-restricted peptides aa 604-611 of p15E protein 
(KSPWFTTL) tetramer was made through the NIH tetramer core facility. KSPWFTTL peptide was made by Biomatik Corporation 
(Ontario, CA). Depletion antibodies anti-CD4 (GK1.5), anti-CD8 (YTS169.4), anti-PD1 (RMP1-14), anti-CTLA4 (9H10), anti-TIM3 
(RMT3-23), anti-4-1BB (LOB12.3) antibodies were purchased from BioXCell (West Lebanon, NH). For brain sections, anti-CD3 
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(17A2, biotin, 1:500), anti-CD31 (2H8, GeneTex, 1:500) and anti-LYVE1 (AF2125, R&D systems, 1:250) antibodies were used with 
anti-streptavidin (FITC, BD biosciences, 4031801, 1:1000), anti-hamster (127-165-160, Cy3, Jackson ImmunoResearch, 128827, 
1:500), and anti-goat (705-175-147, Cy5, Jackson ImmunoResearch , 138513, 1:500) secondary antibodies respectively. 


Validation antibodies were all validated on their respective websites, and all are antibodies regularly used in our lab for other purposes. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) GL261 parental cells were obtained from the NIH cancer cell repository. GL261-Luciferase cells were kind gifts from Dr. 
Jiangbing Zhou (Yale Neurosurgery) and Dr. Carla Rothlin (Yale Immunobiology). YUMMER 1.7 cell lines are previously 
reported50. CT2A and CT2A-BFP cells were a kind gift from Dr. Thomas Mathivet (PARCC, Paris). B16 cells were a kind gift 
from Dr. Noah Palm (Yale Immunobiology). HEK293T cells were purchased from ATCC. HEK293T, CT2A, CT2A-BFP and B16 
cells were cultured in complete DMEM (4.5g/L glucose, 10% FBS, 1% penicillin/streptomycin). YUMMER 1.7 cells were 
cultured in DMEM/F12 media (10% FBS, 1% nonessential amino acids, 1% penicillin/streptomycin). GL261 and GL261-Luc 
cells were cultured in RPMI (10% FBS, 1% penicillin/streptomycin). 


Authentication No authentication was performed. 
Mycoplasma contamination All cells were mycoplasma negative 


Commonly misidentified lines No commonly misidentified lines were used, and all cells were obtained from trusted sources (in material methods). 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Four- to eight-week-old mixed sex C57BL/6 mice, B6.129S2-IghtmICgn/J (UMT) mice, B6.SJL-PtprcaPepcb/BoyCrl mice, were 
purchased from the National Cancer Institute, Jackson Laboratory and Charles River respectively and subsequently bred and 
housed at Yale University. All procedures used in this study (sex-matched, age-matched) complied with federal guidelines and 
institutional policies by the Yale School of Medicine Animal Care and Use Committee. 


Wild animals no wild animals were used 
Field-collected samples No field-collected samples were used. 
Ethics oversight All procedures used in this study (sex-matched, age-matched) complied with federal guidelines and institutional policies by the 


Yale School of Medicine Animal Care and Use Committee. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 


Policy information about clinical studies 
All manuscripts should comply with the ICMJE guidelines for publication of clinical research and a completed CONSORT checklist must be included with all submissions. 


Clinical trial registration No clinical studies were performed 

Study protocol Note where the full trial protocol can be accessed OR if not available, explain why. 

Data collection Describe the settings and locales of data collection, noting the time periods of recruitment and data collection. 
Outcomes Describe how you pre-defined primary and secondary outcome measures and how you assessed these measures. 
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Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 
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Methodology 


Sample preparation Isolation of mononuclear cells and flow cytometry. Tissue was harvested and incubated in a digestion cocktail containing 1 mg ml 


-1 collagenase D (Roche), 1 mg ml-1 collagenase A (Roche) and 30 ug ml-1 DNase | (Sigma-Aldrich) in complete RPMI (10% FBS) 
at 37 °C for 30 min. Tissue was then filtered through a 70 um filter. For brain tissues, cells were mixed in 4 mL of 25% Percoll 
(Sigma-Aldrich) solution and centrifuged at 530g for 15 minutes without a brake. The Percoll layer was removed and cells were 
diluted in 5 mL of 1% BSA. Cells were treated with ACK buffer, and resuspended in 1% BSA. At this point cells were counted using 
an automated cell counter (Thermo fisher). 

For tetramer experiments, staining was performed with antibodies (1:200) and tetramer (1:50) for 60 minutes at room 
temperature. Cells were washed to remove excess antibodies and resuspended in 1% BSA with 10 pL of CountBright absolute 
counting beads (Life technologies, OR) for multiparameter analyses on the LSR II flow cytometer (Becton Dickinson), and 
subsequently analyzed using FlowJo software (10.5.3, Tree Star). For calculation of tetramer positive T cells in each organ this 
calculation was used: 

number of tetramer positive T cells * (# of input beads / # of counted beads) * (# of cells from automated counter / # of total 
events in flow cytometry). 

For cytokine stimulation, surface markers were first stained on ice for 30 minutes. After washing, cells were stimulated in 
complete RPMI with 200uUL of 1x Cell stimulation cocktail without protein transporter inhibitor (eBioscience Cell Stimulation 
Cocktail, ThermoFisher) for 1 hour at 37C. 50 uL of 5x Cell stimulation cocktail with protein transporter (eBioscience Cell 
Stimulation Cocktail, ThermoFisher) was added and incubated for an additional 4 hours. Cells were then fixed with 100uL 2% 
formaldehyde on ice for 45 minutes. Cells were washed with 1x Perm/Wash Buffer (BD Cytofix/Cytoperm, BD Biosciences), and 
then permeabilized with 1x Perm/Wash Buffer (BD Cytofix/Cytoperm, BD Biosciences) for 10 minutes on ice. Intracellular 
antigens were stained on ice for 30 minutes. 

For transcription factor staining, surface markers were first stained on ice for 30 minutes. Cells were then fixed with 100uL 2% 
formaldehyde on ice for 45 minutes. Cells were washed with 1x Perm/Wash Buffer (eBioscience Foxp3/Transcription Factor 
Staining Buffer Set, ThermoFisher), and then permeabilized with 1x FOXP3Perm/Wash Buffer for 10 minutes on ice. Intracellular 
antigens were stained on ice for 30 minutes. 

For AKT phosphorylation staining, surface markers were first stained on ice for 30 minutes. Cells were then fixed and washed 
following BD Phosflow kit directions. 


Instrument LSRII 


Attune NxT Flow Cytometer 


Software FlowJo 10.6.0 
Cell population abundance _ n/a 
Gating strategy All gates are shown in the supplementary information. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Increased cardiac contractility during the fight-or-flight response is caused by 
B-adrenergic augmentation of Cay1.2 voltage-gated calcium channels’ *. However, this 
augmentation persists in transgenic murine hearts expressing mutant Cay1.2 0,.andB 
subunits that can no longer be phosphorylated by protein kinase A—an essential 
downstream mediator of B-adrenergic signalling—suggesting that non-channel 
factors are also required. Here we identify the mechanism by which B-adrenergic 
agonists stimulate voltage-gated calcium channels. We express 04. Or B,, Subunits 
conjugated to ascorbate peroxidase’ in mouse hearts, and use multiplexed 
quantitative proteomics®’ to track hundreds of proteins in the proximity of Cay1.2. We 
observe that the calcium-channel inhibitor Rad®’, amonomeric G protein, is enriched 
inthe Cay1.2 microenvironment but is depleted during §-adrenergic stimulation. 
Phosphorylation by protein kinase A of specific serine residues on Rad decreases its 
affinity for B subunits and relieves constitutive inhibition of Cay1.2, observed as an 
increase in channel open probability. Expression of Rad or its homologue Rem in 
HEK293T cells also imparts stimulation of Cay1.3 and Cay2.2 by protein kinase A, 
revealing an evolutionarily conserved mechanism that confers adrenergic 
modulation upon voltage-gated calcium channels. 


The positive inotropic effect of B-adrenergic agonists on the heart is 
aclassical physiological phenomenon that is universally experienced 
during excitement, exercise and the fight-or-flight response. The effect 
is mediated by B-adrenergic activation of protein kinase A (PKA), which 
inturnleads to increased Ca” influx through L-type Cay1.2 channelsin 
cardiomyocytes’ *. The commonly accepted modelis that PKA increases 
the Ca” current through Cay1.2 channels by phosphorylating Cay1.2 04. 
and/or ,, subunits (Fig. 1a). However, previously proposed putative 
regulatory residues onthe carboxyl termini of a, (Ser1928, Ser1700 and 
Thr1704)'°" and B,, (Ser512 and Ser570)” have been shown to be dis- 
pensable for B-adrenergic stimulation of Ca” currents inthe heart” ”*. 
Nevertheless, given the numerous other serine and threonine residues 
of a, and B.,, it remained possible that PKA-mediated phosphoryla- 
tion of some combination of these was responsible for B-adrenergic 
modulation of Cayl1.2 channels in cardiomyocytes. As shown below, 
this also is not the case. 


Norole for core Cay1.2 phosphorylation 


We developed a transgenic approach that enables doxycycline-induci- 
ble expression of Flag-tagged, dihydropyridine (DHP)-resistant Ca,1.2 
channels in mice (Fig. 1b)’®. The transgenic and endogenous Cay1.2 


currents are distinguishable by application of nisoldipine, a Ca*-chan- 
nel DHP antagonist’. We mutated all 51 conserved and nonconserved 
serine and threonine residues within the 35 intracellular PKA consensus 
phosphorylation sites of rabbit «,, to alanine (‘35-mutant a,,’; Extended 
Data Fig. 1a). In cardiomyocytes, the nisoldipine-insensitive 35-mutant 
Ca”* currents were both activated at more negative potentials and 
increased in response to isoproterenol (a B-adrenoreceptor agonist) 
or forskolin (which stimulates adenylyl cyclase to produce cyclic AMP, 
thereby activating PKA), to the same extent as were nisoldipine-insensi- 
tive wild-type channels (‘pseudo-wild-type (pWT)’ a, channels) (Fig. 1c, 
dand Extended Data Fig. 1b, c). 

Similarly, we mutated to alanine all 37 conserved and nonconserved 
serine and threonine residues within 28 PKA-consensus phospho- 
rylation sites of human B,, (‘28-mutant B,,’; Extended Data Fig. 1d). 
Cardiomyocytes expressing green fluorescent protein (GFP)-tagged 
28-mutant ,, (Extended Data Fig. le, f) displayed isoproterenol- or 
forskolin-induced stimulation of Cay1.2 current amplitude (Fig. le, 
g) and a hyperpolarizing shift in the voltage dependence of activa- 
tion (Extended Data Fig. 1b), similar to cardiomyocytes isolated from 
transgenic mice expressing GFP-tagged wild-type B,, (ref. ”). 

Finally, we crossed 35-mutant a, with 28-mutant B.,, transgenic 
mice. Immunoprecipitation with anti-Flag antibody indicated that 
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Fig. 1| Phosphorylation of a,, and B subunits by PKA is not required for 
B-adrenergic regulation of Cay1.2. a, Diagram showing rabbit cardiac a, (top) 
and B (bottom) subunits. Red dots indicate putative sites of phosphorylation by 
PKA. GK, guanylate kinase domain; SH3, Src homology 3 domain. 

b, Diagrams showing the binary transgene system that permits robust 
expression of Flag-tagged DHP-resistant (DHP*) a,. or GFP-tagged B,, only inthe 
presence of botha reverse tetracycline-controlled transactivator (rtTA) and 
doxycycline (Tet-ON). The top diagram shows expression of the rtTA driven by 
the cardiac-specific a-myosin heavy chain (a-MHC) promoter. The three non- 
coding exons that make up the 5-untranslated region of the a-MHC geneare 
depicted as black boxes, and the introns as lines. The lower two diagrams show 
cDNAs for Flag-DHP*-a,, or GFP-3,, ligated behind seven tandem tetO 
sequences, which impart tetracycline inducibility. c, Exemplar whole-cell Cay1.2 
currents of 35-mutant a, cardiomyocytes (from transgenic mice) in nisoldipine 


GFP-tagged 28-mutant B,, subunits dominate in the 35-mutant o,. 
complex (Extended Data Fig. 1g) in cardiomyocytes isolated from these 
mice. These mutant channels also displayed a normal isoproterenol- or 
forskolin-induced increase in peak Ca** current (Fig. 1f, g) and a hyper- 
polarizing shift in the V,, of activation (Extended Data Fig. 1b). These 
results indicate that B-adrenergic stimulation of Cay1.2 does not involve 
direct phosphorylation of a,, or B, subunits. 


Identifying the Cay1.2 subdomain proteome 


Given the foregoing results, we adapted for application to cardiomyo- 
cytes an enzyme-catalysed proximity labelling method>* in order to 
comprehensively identify components of the Cay1.2 macromolecular 
complex. We generated transgenic mice with doxycycline-inducible, 
cardiomyocyte-specific expression of DHP-resistant a, or B,, proteins 
with ascorbate peroxidase (APEX2) and a V5 epitope conjugated to 
the amino termini, enabling biotin labelling of proteins within around 
20 nm (ref. '8) of the Ca” channels. Fusing APEX2 to a, and B,, did not 
affect Cay1.2 subcellular localization and function in cardiomyocytes, 
as assessed using cellular electrophysiology, fractional shortening and 
immunofluorescence (Extended Data Fig. 2a—d). Incubating isolated 
ventricular cardiomyocytes or perfusing whole hearts with a solution 
containing biotin phenol, followed by exposure to hydrogen peroxide, 
induced robust biotinylation of proteins in a striated z-disk pattern, 
coinciding with the pattern of transgenic a,. and B,, subunits (Fig. 2a, 
b and Extended Data Fig. 2d, e). 

Western blots confirmed the biotinylation of proteins known to be 
localized near Ca,1.2 at dyadic junctions in cardiomyocytes (Fig. 2c). By 
contrast, Ky1.5 channels were not biotinylated and enriched (Fig. 2c), 
implying that these channels may not be as closely localized to Cay1.2. 
Triple-stage mass spectrometry (TMT SPS MS?) identified and quanti- 
fied hundreds of other biotinylated proteins (Supplementary Table 1), 
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before (black trace) and after (blue trace) treatment with isoproterenol. 
Representative of 25 experiments; pA pF ¢, picoamperes per picofarad. 

d, Fold change in peak DHP-resistant Ca” current at O mV caused by 
isoproterenol or forskolin. Dots show data points. Data are mean +s.e.m.; P=0.39 
by unpaired two-tailed t-test; n=45 cardiomyocytes from 5 mice (isoproterenol); 
n=25cardiomyocytes from 5 mice (forskolin). e, f, Exemplar whole-cell Cay1.2 
currents of cardiomyocytes from GFP-tagged 28-mutant B,, transgenic mice (e), 
and from 35-mutant a, crossed with 28-mutant B,, transgenic mice (f). 
Representative of 8 and 22 independent experiments respectively. 

g, Fold change in peak Ca** current caused by isoproterenol or forskolin for 
cardiomyocytes isolated from transgenic mice expressing GFP-tagged wild-type 
(WT) B,, subunit”, GFP-tagged 28-mutant B,,, or both 35-mutant a, and GFP- 
tagged 28-mutant B,,. Data are mean+s.e.m.; P=0.27 by one way-ANOVA; n=19, 
8 and 21 cardiomyocytes from 4, 4and3 mice, from left to right. 


although many probably constitute bystanders rather than physically 
interacting proteins. Of these, the 150 proteins with the highest peptide 
counts were remarkably similar in 0,--APEX2 and },,-APEX2 transgenic 
mice (Extended Data Fig. 3a). These were primarily classified as being 
membrane, cytoskeletal or sarcomeric proteins (Extended Data Fig. 3b, 
cand Supplementary Table 2). Some of these proteins, however, were 
associated with other compartments, probably reflecting the labelling 
of proteins encountered during Cay1.2-APEX2 synthesis, maturation 
and trafficking. 


Adrenergic modulation of Ca,1.2 neighbourhood 


It seemed likely that PKA-dependent stimulation of Ca* currents inthe 
heartinvolves recruitment of a distinct activator protein to, or loss ofan 
inhibitory protein from, the Cay1.2 macromolecular complex. Recently, 
APEX2 labelling, combined with either multiplexed quantitative mass 
spectrometry’ or quantitative proteomics using a system of spatial 
references and bystander ratio calculations”, was used to analyse 
ligand-induced changes in the local environment of G-protein-coupled 
receptors. Notably, we found that conjugation of peroxidase to either 
of the Cay1.2 subunits did not interfere with B-adrenergic stimulation 
of Ca** currents (Extended Data Fig. 2f, g), and that the B-adrenergic 
agonist signalling pathways were preserved in the presence of biotin 
phenol and during hydrogen-peroxide-induced labelling, assessed by 
phospholamban phosphorylation (Extended Data Fig. 2h-k). 

We preincubated isolated cardiomyocytes from a,--APEX2 and 
B2,-APEX2 mice with biotin phenol for 30 minutes; during the final 
10 minutes, the cells were also exposed to either isoproterenol or 
vehicle (Fig. 2d). We purified and quantified biotinylated proteins 
using TMT SPS MS’. The relative summed peptide TMT signal-to- 
noise ratio—indicating relative protein quantification—changed for 
several proteins (Fig. 2e, f). We also probed the effect of isoproterenol 
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Fig. 2| Changes in the Cay1.2 subdomain proteome upon B-adrenergic 
agonist activation of PKA signalling in the heart. a, Immunofluorescence of 
isolated «,.--APEX2 and B,,-APEX2 cardiomyocytes exposed to biotin-phenol 
and hydrogen peroxide. Representative of 13 and 8 cardiomyocytes from 2 and 
3 mice respectively. Staining is with anti-V5 and Alexa594-conjugated 
secondary antibodies and streptavidin-conjugated Alexa488; nuclear 
labelling is with 4’,6-diamidino-2-phenylindole (DAPI). Scale bars, 5 um. 

b, Immunofluorescence of tissue sections of Langendorff-perfused a,--APEX2 
heart. Scale bars, upper panel, 100 xm; lower panel, 5 pm; lower inset, 5 um. 
Representative of ten sections from two mice. c, Immunoblots of biotin- 
labelled proteins from cardiomyocytes of a,--APEX2 and B,,-APEX2 mice. 
Cay1.2 subunits, RyR2,Jph2 and CaM are detected, whereas Ky1.5 channels are 
not detected, in streptavidin pulldown. Blots representative of three 
independent experiments. d, Workflow for processing of isolated 
cardiomyocytes. H,O,, hydrogen peroxide; iso, isoproterenol; PLB, 
phospholamban. e, Volcano plot of fold change for relative protein 
quantification by TMT mass spectrometry of a,--APEX2 samples. Data are 
means from five pairs of biologically independent samples; non-adjusted 
unpaired two-tailed t-test. Levels of Rad in the neighbourhood of Cay1.2 (red 


on the Cay1.2 proteome signature in Langendorff-perfused whole 
hearts (Fig. 2g, h). We detected isoproterenol-induced recruitment 
of the PKA catalytic subunit (PKA,,,) to Cay1.2 channels in cardiomyo- 
cytes isolated from both a,--APEX2 and B,,-APEX2 mice (Fig. 2e, 
f). All three approaches (using isolated cardiomyocytes from o,.— 
APEX2 and B,,-APEX2 mice, and a«,--APEX2 whole hearts) indicated 
a30-50% decrease in the amount of the small Ras-like G protein Rad 
in the neighbourhood of Cay1.2 following application of isoproter- 
enol (Fig. 2e, f,h, Extended Data Figs. 4a—c, 5a, b and Supplementary 
Tables 3-5). By analysing the overlap of proteins displaying isopro- 
terenol-induced changes in a,.-APEX2 and B,,-APEX2 experiments, 
we found Rad to be the only candidate protein displaying this behav- 
iour (Extended Data Fig. 5b, c). By contrast, a10-minute exposure of 
cardiomyocytes isolated from nontransgenic mice to isoproterenol 
(Fig. 2i) had a minimal effect on proteins relatively quantified by 
TMT SPS MS?, and specifically no substantial effect on the amount 
of Rad by comparison with untreated paired cardiomyocytes (Fig. 2j, 
Extended Data Fig. 4d and Supplementary Table 6). In this control 
experiment, proteins were not proximity labelled. Thus, §-adrenergic 


dot) are reduced by 50%, and levels of the PKA catalytic subunit (green dot) are 
increased by 50%, upon application of isoproterenol. f, As ine, except for 
cardiomyocytes from B,,-APEX2 mice. Data are means from three pairs of 
biologically independent samples. Levels of Rad are reduced by 30% and of the 
PKA catalytic subunit are increased by 68% in the neighbourhood of Cay1.2 
following application of isoproterenol. g, Workflow showing protein labelling 
and analysis of Langendorff-perfused a,--APEX2 mouse hearts. Bpm, beats per 
minute. h, Asine, but for proteins from a,--APEX2 whole-heart samples. Data 
are means for five hearts: five without and five with isoproterenol. Levels of 
Radin the neighbourhood of Cay1.2 are reduced by 36% following application of 
isoproterenol. i, Workflow for analysis of isolated cardiomyocytes from non- 
transgenic (NTG) mice.j, As ine, but for proteins isolated from NTG 
cardiomyocytes without biotinylation or pull-down. The asterisk indicates the 
single peptide ESFDSQSLINNQSK, mapping to Uniprot D3ZON2 (Sorbs2), with 
its apparent abundance reduced by isoproterenol treatment in non-transgenic 
cardiomyocytes to about 27%, probably because of post-translational 
modification. Data are means for four pairs of biologically independent 
samples. Levels of Rad in whole heart are unchanged by isoproterenol. For 
source gel data, see Supplementary Fig. 1. 


stimulation depletes Rad from the neighbourhood of Cay1.2, but not 
from the cell as a whole. 

Rad is amember of the Rad, Rem, Rem2 and Gem/Kir (RGK) family 
of Ras-like GTP-binding proteins—known for their capacity to inhibit 
all high-voltage-activated Ca”* channels®’—and is a potential PKA tar- 
get”. Rad-knockout mice display an increased basal Ca”* current with 
activation at lower voltages and reduced B-adrenergic stimulation of 
their Ca** channels”. Other studies, however, led to expectations that 
Radis not directly involved in adrenergic regulation of Cay1.2, because 
overexpression of Rad” or Rem” in cultured cardiomyocytes markedly 
attenuated the basal Cay1.2 current, which was also not increased by 
B-adrenergic stimulation. Nevertheless, our proximity labelling results 
elevated Rad as the leading candidate for the critical missing link that 
enables PKA regulation of Cay1.2. 


PKA regulates Ca,y1.2 via Rad phosphorylation 


The robust heterologous reconstitution of PKA regulation of Cay1.2 
currents has long been pursued as a crucial step in identifying and 
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Fig. 3 | Phosphorylation of Rad is required for cAMP-PKA-mediated 
activation of Ca,1.2.a,c, e, f, Ba** current elicited by voltage ramp every 10s, 
with black traces obtained before and blue traces obtained after forskolin 
treatment. Representative of 15, 16, 8 and 13 cells, from left to right. b, d, Diary 
plot of normalized Ba** current amplitude at 0 mV. Representative of 15 and 16 
cells. g, Fold change in maximum conductance (G,,,,) induced by forskolin. 
Data are mean +s.e.m.; P< 0.0001 by one-way ANOVA; **P< 0.01, ****P< 0.0001 
by Tukey’s test. n=27, 76, 9,23 and 18 cells, from left to right. Specific Pvalues 
can be found in the associated Source Data (see Supplementary Information). 
h, Boltzmann function parameter V;). Data are mean +s.e.m.;***P< 0.001 by 
two-tailed paired t-test. n=15, 16, 8 and 13, from left to right. i, Fold change in 
Giax induced by forskolin in the absence and presence of Rad. Dataare 

mean +s.e.m.;****P< 0.0001 by unpaired two-tailed t-test. n=7 and 16, from left 
toright.j-I, The top rows display stochastic records, where channel closures 
are zero-current portions of the trace (horizontal grey lines) and openings are 
downward deflections to the open level (slanted grey lines). Inthe bottom row, 
pale blue and grey lines are average P)-V relationships from multiple cells. Blue 
and black lines are Boltzmann fits. In all experiments, a,.and B,, were 
expressed in HEK cells with no Rad (j), with wild-type Rad (k), or with 4 SA- 
mutant Rad (I), inthe absence or presence of exogenous PKA catalytic subunit. 
The upper dashed lines show maximal P, for control of a,. plus B,, without Rad. 
Scale bars, 1 pA and 25 ms. Control, n=10; control plus PKA, n=5; Rad,n=5; Rad 
plus PKA, n=8;4SA mutant Rad, n=6,4SA mutant Rad plus PKA, n=5. 


validating the mechanism. We now find that Rad was the missing ingre- 
dient. To prove this experimentally, we coexpressed Rad with a,, and 
B., Subunits in HEK293T cells, limiting Rad expression by using a1/3 to 
1/6 ratio of Rad/Cay1.2 complementary DNA to avoid complete inhibi- 
tion of the Ca** current. We used a perforated, whole-cell patch clamp 
technique to preserve the normal intracellular milieu and signalling 
cascades, and to minimize current run-down. Ba” was used as a charge 
carrier to eliminate Ca”*-dependent inactivation. In HEK293T cells 
transfected with only a,, plus B,,, superfusion of forskolin over 1-3 
minutes had no impact ona Ba” current (Fig. 3a, b, g, h). By contrast, 
applying forskolin to cells expressing 0,¢, B., and Rad increased the 
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maximal conductance (G,,,,) by as muchas 4.5-fold and by amean of 1.5- 
fold, and shifted the V;, for activation (Fig. 3c, d, g, hand Extended Data 
Fig. 6a, b). The forskolin-induced increase in current was inversely pro- 
portional to the basal current density, as observed in cardiomyocytes 
(Extended Data Figs. 1c, 6c). In HEK293T cells expressing 35-mutant 
Qc plus 28-mutant ,, plus Rad, applying forskolin increased G,,,, by 
as muchas 3.1-fold and by a mean of 1.9-fold, and caused a hyperpolar- 
izing shift in the V;, for activation (Fig. 3i and Extended Data Fig. 6d, 
e). For both wild-type and phosphorylation-site-mutant a,. and B5, 
subunits, the forskolin-induced enhancement of Ba” current in Rad- 
transfected cells was greatest at hyperpolarized potentials and fell as 
the test potential approached the reversal potential (Extended Data 
Fig. 6f, g), consistent with observations in cardiomyocytes”. 

We used single-channel recordings to determine the mechanism of 
PKA/Rad modulation of Cay1.2. In the absence of Rad, sweeps with no 
openings or blank sweeps are rare, while most sweeps exhibit either 
intermediate or high levels of openings (Fig. 3j and Extended Data 
Fig. 6h, k). In HEK293T cells transfected with a, plus B,,, coexpres- 
sion of PKA,,, Subunit has no effect on open probability (P,) (Fig. 3j). 
When Rad is expressed, the fraction of blank sweeps is increased, 
the low-activity mode predominates, and the P, is reduced (Fig. 3k 
and Extended Data Fig. 6i). By comparison, if the PKA,,, Subunit is 
coexpressed with Rad, the fraction of blank sweeps is reduced, the 
high-activity mode resurges, and the P, is increased by 10.6 + 2.9-fold 
compared with transfection without PKA (Fig. 3k and Extended Data 
Fig. 6j, 1). These results suggest that Rad potently dampens the Cay1.2 
current, while phosphorylation of Rad allows channels to operate as 
though they were devoid of Rad. 

We identified 14 consensus PKA phosphorylation sites in Rad 
(Extended Data Fig. 7a), and we mutated these sites to alanine. The 
mutant Rad effectively inhibited Cay1.2 currents; however, the cAMP/ 
PKA-mediated upregulation of Cay1.2 current was lost (Fig. 3g). In 
lysates from forskolin-stimulated HEK293T cells transfected with GFP- 
Rad, we observed phosphorylation of Ser25, Ser38 and Ser300 by mass 
spectrometry (Extended Data Fig. 7b). These residues were previously 
identified in mouse hearts as phosphorylation targets” (Extended 
Data Fig. 7c). We were unable to detect either unphosphorylated or 
phosphorylated peptides containing Ser272, notwithstanding prior 
biochemical studies identifying this residue as a PKA target”’. Alanine 
substitutions of Ser25, Ser38, Ser272 and Ser300 in Rad (4-SA mutant; 
Extended Data Fig. 7d) prevented the forskolin-induced increase in 
Gmax and hyperpolarizing shift in the current-voltage curve (Fig. 3e, 
g, h). By contrast with transfection with wild-type Rad, cotransfec- 
tion of PKA with 4-SA mutant Rad failed to increase Pp (PKAto no PKA, 
1.15 + 0.56-fold; Fig. 31). 

AC-terminal polybasic region of 32 amino acids in Rad is involved in 
plasma-membrane targeting via binding to negatively charged phos- 
pholipids suchas phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5) 
P,, also known as PIP,)””. Deletion of the C terminus of RGK GTPases 
prevents their inhibition of Ca”*-channel function’. We found that 
alanine substitutions at Ser272 and Ser300 (2-SA mutant) within the 
C-terminal polybasic membrane region prevented both the forskolin- 
induced increase in current amplitude and the hyperpolarizing shift 
inthe current-voltage curve (Fig. 3f, h and Extended Data Fig. 6c). 


Role of Rad binding to Ca,1.2 B subunits 

Rad can inhibit Cay1.2 via B-dependent and B-independent (a,-- 
dependent) mechanisms”. Substituting Rad residues Arg208 and 
Leu235, or Bo, residues Asp244, Asp320 and Asp322, with alanine 
(Extended Data Fig. 8a, b) attenuates Rad binding to B subunits”*”’. 
These mutations prevented the forskolin-induced increase in G,,,, and 
the hyperpolarizing shift in the current-voltage curve (Fig. 4a and 
Extended Data Fig. 8c, d). Thus, the interaction between Rad and B 
subunits is essential for cAMP-PKA regulation of Cay1.2. 
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Fig. 4| RGK GTPases confer adrenergic regulation to Cay1.2, Cay1.3 and 
Ca,2.2 channels via binding to B. a, Fold change inG,,,,, for different 
combinations of wild-type and mutant Rad and B,, subunit. Data are 
mean+s.e.m.;P< 0.0001 by one-way ANOVA; ***P< 0.001 by Dunnett’s test. 
The data for WT Rad and WT &,, areas in Fig. 3g.n=76,20and 15, left to right. 
Specific Pvalues can be found in the associated Source Data (see 
Supplementary Information). b,c, At the left are shown schematics of the FRET 
pairs, Cer-B,, with Ven-WT-Rad (b) or Ven-4SA-mutant Rad (c) (where Ven is 
the venus tag and Cer is the cerulean tag). At the right, FRET efficiency (£,) is 
plotted against the free concentration of Ven-WT (b) or Ven-4SA-mutant Rad 
(c). The solid line fits a1/1 binding isotherm. d, Ba** current of Cay1.3 channels, 
without or with expression of Rad, elicited by voltage ramp every 10s. Black 
traces indicate before and blue traces after forskolin treatment. No Rad, Rad: 
seven cells each. e, Fold change inG,,,,. Data are mean +s.e.m.; ****P< 0.0001 by 
unpaired two-tailed ¢-test;n=7 in both cases. f, Boltzmann function parameter 
V9. Data are mean +s.e.m.;*P< 0.05; **P< 0.01 by paired two-tailed t-test; n=7 
inall cases. g, Ba* current of Cay2.2 channels, without or with expression of Rad 
or Rem, elicited by voltage ramp every 10s. Black traces were obtained before, 
and blue traces after, treatment with forskolin. Representative of 11, lland15 
cells, from top to bottom. h, Fold change inG,,,,,. Data are mean +s.e.m. 
P<0.001 by one-way ANOVA; ***P< 0.001, *P< 0.05 by Dunnett’s test. n=11,11 
and15, fromleft to right. i, Boltzmann function parameter V;,. Dataare 

mean +s.e.m.;****P< 0.001 by paired two-tailed t-test; n=11, 11and15, from left 
to right.j, k, Proposed model of B-adrenergic regulation of Ca” channels. 

j_k, Basal state (j), and after B-agonist-induced activation of adenylyl cyclase 
(AC) (k) leads to activation of PKA and hence to PKA-mediated phosphorylation 
of Rad, causing dissociation of Rad from the Cay1.2 complex and therefore 
increased Ca” influx. 


We used a flow-cytometry Forster resonance energy transfer (FRET) 
two-hybrid assay” to probe potential PKA-mediated changes in the 
binding of B,, subunits and wild-type Rad. At baseline, we detected 
robust binding between cerulean-tagged B,, subunit and venus-tagged 
wild-type Rad, consistent with previous studies” (with an effective 
dissociation constant (K,¢,;) of 7,957 + 418; Fig. 4b and Extended Data 
Fig. 9c). Coexpression of the PKA,,, subunit, however, markedly weak- 
ened this interaction (Kj ,--=145,231+3,084). By contrast, coexpression 
of the PKA,,, subunit in cells expressing fluorophore-tagged B,, and 4-SA 
mutant Rad had no effect on FRET binding (Kj¢-; = 4,349 + 138 versus 
Kaerr = 4,346 + 197 with and without, respectively, the PKA catalytic 
domain; Fig. 4c and Extended Data Fig. 9c). These results suggest that 
phosphorylation of Rad is required for dissociation of the Rad-3,, inter- 
action. Ina similar manner, PKA phosphorylation of Rad also reduced 
FRET binding to both B, and B, (Extended Data Fig. 9a-c). 


Unified mechanism for PKA regulation 


Inadrenal chromaffin cells and the sinus node cells of the heart, L-type 
Ca,1.3 channels are robustly stimulated by PKA®*’. We found that, in 
HEK293T cells transfected with only Cay1.3 a,p plus B,,, superfusion of 
forskolin over 1-3 minutes had no impact on the Ba” current (Fig. 4d-f). 
By contrast, in cells expressing ap, B., and Rad, applying forskolin 
increased G,,,,, by as much as 2.3-fold and by a mean of 1.9-fold, and 
shifted the V,,,. for activation (Fig. 4d-f). We also expressed the N-type 
Cay2.2 0, subunit, which is widely expressed in neurons, with B,, and 
Rad in HEK293T cells. Forskolin increased G,,,, through Cay2.2 when 
coexpressed with Rad bya mean of 2.2-fold and shifted the V;, for activa- 
tion (Fig. 4g-i). For both Cay1.3 and Ca,2.2, attenuating binding of Rad 
to Bsubunits prevented the forskolin-induced modulation of Cay2.2 cur- 
rent (Extended Data Fig. 8e, f). We also expressed in HEK293T cells the 
Cay2.2 &,, subunit with B,, and Rem, another member of the RGK GTPase 
family. Forskolin increased G,,,, through Cay2.2 when coexpressed with 
Rem by1.6-fold and shifted the V;,. for activation (Fig. 4g-i). Thus, PKA- 
mediated modulation of Cay channels is not idiosyncratic, as currently 
believed; rather, it is emerging to bea universal mechanism transferable 
to all Cay channels that bind B subunits. 


Discussion 


The core a,, and B,, subunits, previously hypothesized to contain the 
PKA target sites required for B-adrenergic agonist-induced stimulation 
of Cay1.2, do not. Rather, successful reconstitution of regulation ina 
heterologous expression system requires an additional component, 
which we now identify as Rad. The cCAMP-PKA-mediated regulation 
of Cay1.2 requires both phosphorylation by PKA on the C terminus of 
Rad and the interaction of Rad with the B subunit. Multiple-alignment 
analysis of Rad from mice and other species shows that the four phos- 
phorylation sites are conserved (Extended Data Fig. 10a). The required 
interaction with the B subunit is consistent with our recent finding that 
disrupting the a,.-f interaction prevents the regulation of Cay1.2 by 
PKA inthe heart”. 

Analysis of Rad and other members of the RGK GTPase family indi- 
cates that their C-terminal phosphorylation sites are highly similar 
(Extended Data Fig. 10b). Short stretches of basic and hydrophobic 
amino acids are known to interact with the membrane” and phospho- 
rylation of residues within these stretches alters their electrostatic 
character, thereby reducing membrane affinity**. We found that phos- 
phorylation of two serine residues within the C-terminal polybasic 
region of Rad releases Ca”* channels from Rad-mediated inhibition, 
probably by means of reducing the affinity of Rad with the membrane 
and with the Ca, B subunit (Fig. 4j, k). This mechanism of regulation is 
modular and transferable, as Cay1.3 channels and neuronal Cay2.2 chan- 
nels are also imparted with forskolin/PKA-mediated upregulation via 
Rad and Rem. The activation of Ca”* channels via release of inhibition 
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by PKA phosphorylation is reminiscent of PKA phosphorylation of 
phospholamban, an inhibitor of the sarco/endoplasmic reticulum 
Ca**-ATPase*. 

Whereas phosphorylation of Ser1928 is definitively not required 
for B-adrenergic regulation of Cay1.2 in the heart, substitution of this 
residue with alanine prevents B-adrenergic stimulation of Ca” chan- 
nels in hippocampal neurons and hyperglycaemia-induced stimulation 
of Ca** currents in arterial smooth muscle cells***”. Perhaps there are 
tissue-specific differences in Cay1.2 regulation. Phosphorylation of the 
Q,- subunit may also affect the trafficking and clustering of channels 
in neurons and cardiomyocytes”. 

Our results demonstrate that proximity labelling using APEX2 is 
feasible in the heart and, combined with multiplexed TMT proteomics, 
canidentify a dynamically evolving network of interactions induced by 
B-adrenergic stimulation. This study establishes the utility of proximity 
labelling in animals and provides an important foundation for future 
studies that will investigate how diseases, suchas heart failure, change 
the proteomic subdomain of the excitation-contraction coupling 
machinery. 

Augmented Ca” entry enhances the opening of ryanodine recep- 
tors via Ca?*-induced Ca” release and increases cardiac contractility. 
Dysregulation of Cay1.2 activity can result in cardiac arrhythmias, heart 
failure and sudden death. Supporting an important role of RGK GTPases 
in humans, aRem2 variant has been identified as a genetic modifier in 
long QT syndrome 2 (ref. *°), and a Rad variant was recently linked to 
Brugada syndrome”. Our results identify potential targets and interac- 
tion sites for the therapeutic modulation of B-adrenergic regulation of 
Ca** currents in the heart and other tissues. For instance, disrupting the 
interaction between Rad and B subunits can be inotropic by increasing 
Ca** entry in the heart. Conversely, interfering with the a,,—f interac- 
tion”, blocking PKA phosphorylation of Rad or potentially enhancing 
the interaction of Rad with the plasma membrane could attenuate 
the sympathetic nervous system activation of cardiac Ca entry and 
inotropy more specifically than do B-blockers. 
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Methods 


Clone construction and cell culture 

All mouse N-terminally GFP-tagged Rad (GenBank accession number 
XM_006531206) constructsin a pEGFP-Cl vector were generated by gene 
synthesis (Gene Universal). Human Cay2.2 (pSAD442-1) was a gift from 
D. Lipscombe (Addgene plasmid 62,574; http://n2t.net/addgene:62574; 
Resource Identification (RRID) number Addgene_62574). AllcDNA 
clones were authenticated by sequencing. 

HEK293T cells (American Type Culture Collection (ATCC), CRL- 
3216) were cultured in Dulbecco’s modified Eagle’s medium (DMEM) 
with 10% fetal bovine serum (FBS) and 1% penicillin/streptomycin, 
and were transfected with 3 pg rabbit a,, (accession number X15539), 
human aq,, (for Cay2.2 experiments) or rat &,p (for Cay1.3 experiments, 
accession number AF30700039), 3 pg human B,, (NM-201590.3) and 
0.5 pg N-terminal-GFP-tagged mouse Rad using Lipofectamine 2000 
(Thermo Fisher Scientific). For the FRET experiments, rat B, (acces- 
sion number NM_012828) and rat B, (accession number LO2315) were 
used. The media was changed 4-6 h after transfection. The cells were 
split onto coverslips coated with attachment factor protein (Gibco). 
Electrophysiological recordings were carried out at room temperature 
24-48 h after transfection. The cells were authenticated by ATCC and 
tested for mycoplasma contamination. 


Generation of transgenic mice 

The Institutional Animal Care and Use Committee at Columbia Uni- 
versity approved all animal experiments. We used male and female 
mixed-strain mice at 6 weeks to 4 months of age. Sample sizes exceeded 
the number of samples determined by power calculations, which were 
based on effect sizes reported previously’*””*?*, The number of mice 
was always greater than three per genotype. The investigators were 
blinded to group allocation during data acquisition and analysis. 

Transgenic constructs were generated by fusing rabbit a,. cDNA or 
human B,, cDNA tothe modified murine a-myosin heavy chain (MHC) 
tetracycline-inducible-promoter vector (a gift from J. Robbins and 
J. Molkentin, University of Cincinnati, Cincinnati, OH)***. A 3x Flag 
epitope was ligated in-frame to the N terminus of a,,. The 0, subu- 
nit was engineered to be dihydropyridine (DHP)-insensitive with the 
substitutions T1O66Y and Q1070M (refs. *°*’). GFP was ligated to the 
N terminus of B,,. The creation of GFP-WT-B,, transgenic mice was 
described previously”. The V5 epitope and APEX2 cDNA**®, created 
by gene synthesis, were conjugated to the N terminus of DHP-resistant 
Q,. and WT £,,. 

The 35-a mutant and the 28-8 mutant cDNA were generated by site- 
directed mutagenesis. The optimal PKA phosphorylation motif is a 
tetrapeptide with arginine at the second and third positions (termed 
-2 and -3) before the phosphorylated serine or threonine, and a large 
hydrophobic residue immediately thereafter (R-R-X-S/T-®)**. The 
requirement for a positive charge is highest for residues at —2 and 
-3, but can be found for residues as far as position —6 in PKA target 
sites”. Sites with arginine in positions between —4 and -1arestrongly 
preferred, and histidine or lysine to a lesser extent°”°*. We identified 
all potential intracellular PKA phosphorylation sites (Extended Data 
Fig. 1a, d) in rabbit a,, and human B,, using both manual sequence 
analysis and several web-based PKA phosphorylation prediction tools, 
including pkaPS*, DISPHOS*, GPS*°, NETPHOS~* and SCANSITE°’. 
Each predicted serine or threonine was mutated to alanine. We also 
mutated additional serines and threonines within several amino- 
acid residues C-terminal to the arginine or lysine, in order to ensure 
that we fully captured each phospho-regulatory site. We replaced 51 
residues in rabbit o,. with alanine at 35 potential phospho-regulatory 
domains in the 35-mutant o,, construct, and replaced 37 residues 
with alanine at 28 putative phospho-regulatory domains of B,,. We 
excluded those sites that were predicted to be extracellular or within 
the plasma membrane. 


Transgenic mice with non-targeted insertion of these tetracycline- 
regulated cDNAs (Fig. 1b) were bred with cardiac-specific (a-MHC), 
doxycycline-regulated, codon-optimized reverse transcriptional 
transactivator (rtTA) mice (obtained via the Mutant Mouse Resource 
and Research Center) to generate double-transgenic mice. For the 
O,--APEX2 and B,,-APEX2 mice, transgene expression did not require 
doxycycline owing to a low basal binding of rtTA protein to the Tet 
operator sequences (so called ‘leak’)°’. These expression levels result 
in Ca** current levels similar to those found under native conditions in 
the heart. The results presented here were consistent across all founder 
lines and gender, and therefore were pooled. 


Isolation of adult cardiac myocytes 

Mice ventricular myocytes were isolated by enzymatic digestion using 
a Langendorff perfusion apparatus as described’*"”?°, Cardiomyo- 
cytes were isolated from 8-12-week-old non-transgenic and transgenic 
mice. Only non-contracting rod-shaped cells with clear striations were 
used for functional studies (electrophysiology and excitation-contrac- 
tion coupling). 


Proximity labelling biotinylation 

Proximity labelling was performed as described** with minor modifica- 
tions. Isolated ventricular cardiomyocytes were incubated in labelling 
solution with 0.5 mM biotin-phenol (Iris-biotech) for 30 min. For some 
experiments, during the final 10 min of incubation, 1 pM isoproterenol 
(Sigma 15627) was added. To initiate labelling, H,O, (Sigma H1009) was 
added to a final concentration of 1mM for 1 min. Exactly 1 min after 
H,O, treatment, the cells were washed three times with cold quench- 
ing solution containing 10 mM sodium ascorbate (VWR 95035-692), 
5mM Trolox (Sigma 238813) and 10 mM sodium azide (Sigma S2002). 
After cells were harvested by centrifugation, the quenching solution 
was aspirated and the pellet was flash-frozen and stored at -80 °C until 
streptavidin pull-down. 

For biotinylation in Langendorff-perfused hearts, mice were injected 
with5 mg kg™ propranolol (Sigma PHR1308) in order to suppress adr- 
energic stimulation during isoflurane anaesthesia and cardiectomy. 
Hearts were retrograde perfused with Krebs solution for 10 min before 
addition of biotin-phenol for 15 min. During the final 5 min, 1 1M iso- 
proterenol or vehicle was added to the perfusate. Electrocardiograms 
were monitored throughout the experiment to ensure viability of the 
preparation and an isoproterenol-induced increase in heart rate. 

The cells or whole heart tissue were lysed with a hand-held tip homog- 
enizer in a solution containing SO mM Tris (tris(hydroxymethyl)ami- 
nomethane), 150 mM NaCl, 10 mM EGTA, 10 mM EDTA, 1% Triton X-100 
(v/v), 0.1% SDS (w/v), 10 mM sodium ascorbate, 5 mM Trolox and10 mM 
sodium azide, phosphatase inhibitors (Sigma 4906845001), protease 
inhibitors (Sigma 4693159001), calpain inhibitor I (Sigma A6185) and 
calpain inhibitor II (Sigma A6060). Biotin labelling of the samples was 
confirmed after size fractionation of proteins on SDS-polyacrylamide 
gel electrophoresis (PAGE), transfer to nitrocellulose membranes, and 
probing with streptavidin-conjugated horseradish peroxidase (HRP) 
(Thermofisher, $911, lot number 1711896, 0.6 mg mI). The response to 
isoproterenol was assessed by immunoblotting with an anti-phospho- 
phospholamban (Ser16/Thr17) antibody (Cell Signaling, number 8496, 
lot number 1; 1/1,000 dilution). 

Biotinylated proteins were bound to streptavidin magnetic beads 
(Thermo Fisher Scientific 88817), and the beads were washed three 
times witha solution containing 4 M urea, 0.5% SDS (w/v) and 100 mM 
sodium phosphate pH 8. Proteins were size-separated on SDS-PAGE, 
transferred to nitrocellulose membranes, and probed with anti-V5 
antibody (Thermofisher, R960-25; 1/5,000 dilution), a custom-made 
polyclonal anti-o,, antibody (Yenzym, 1/1,000 dilution)”, a custom- 
made polyclonal anti-§-antibody (epitope: mouse residues 120-138, 
DSYTSRPSDSDVSLEEDRE; Yenzym; 1/1,000 dilution), an anti-JPH2 
antibody (Pierce, PAS-20642, lot number NG1583142; 1/1,000 dilution), 
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an anti-calmodulin antibody (Millipore Sigma 05-173; 1/1,000 dilu- 
tion), a custom-made anti-RyR2 antibody (1/5,000 dilution)®, and an 
anti-K,1.5 antibody (Alomone, APC-150, lot number APCOO4AN0850; 
1/1,000 dilution). 


Immunoprecipitations 

Cardiomyocytes were lysed with a hand-held tip homogenizer in a1% 
(v/v) Triton X-100 buffer containing SO mM Tris-HCl (pH 7.4) 150 mM 
NaCl, 10 mM EDTA, 10 mM EGTA and 0.01 mM calpain inhibitor I, 0.01 
mM calpain inhibitor II, and complete protease inhibitors (1 per 7 ml, 
Roche). The lysates were incubated on ice for 30 min and centrifuged 
at 14,000 r.p.m at 4 °C for 10 min; supernatants were then collected. 
Anti-Flag antibody (Sigma, F7425, lot number 078M4886V) immuno- 
precipitations were performed, as described”, overnight in alysis buffer 
consisting of 50 mM Tris-HCl pH 7.4, 150 mM NaCl, 0.25%-1% Triton 
X-100 (v/v), 10 mM EDTA, 10 mM EGTA, 0.01 mM calpain inhibitor I, 
0.01 mM calpain inhibitor Il, and complete protease inhibitors (1 per 
7 ml). Antibody-protein complexes were collected using protein-A- 
conjugated agarose (Amersham) for 2 h, followed by three washes in 
lysis buffer. Proteins were size-separated by SDS, transferred to nitrocel- 
lulose membranes and probed with HRP-conjugated anti-Flag antibody 
(Sigma, A8592), acustom-made anti-B-antibody and HRP-conjugated 
secondary goat anti-rabbit antibody. Detection of luminescence was 
performed with a charge-coupled-device (CCD) camera (Carestream 
Imaging). The uncut gels are shown in Supplementary Fig. 1. 


Immunofluorescence 

Isolated cardiomyocytes were first exposed to biotin-phenol and H,O, 
as described above. After quenching, the cells were fixed for 15 min 
in 4% paraformaldehyde, washed with glycine/phosphate-buffered 
saline (PBS) twice, treated with 0.1% Triton X-100 (v/v) in PBS (PBST) 
for 5 min, and blocked with 3% bovine serum albumin (BSA; w/v) in PBS 
for 1h. Indirect immunofluorescence was performed using 1/500 dilu- 
tions of anti-V5 antibody (Thermo Fisher, R960-25), 1/200 dilutions of 
Alexa594-labelled goat anti-mouse antibody (Thermo Fisher, A11032, 
lot 2069816) and 1/800 dilutions of streptavidin—Alexa Fluor 488 con- 
jugate (Thermo Fisher, $32354, lot 1719656). Images were acquired 
using a confocal microscope. 


Processing biotinylated proteins for mass spectrometry 
Proteins were prepared as described’ with some modifications. Pro- 
teins were precipitated with trichloroacetic acid (TCA; Sigma T9159) 
and then centrifuged at 21,130g at 4 °C for 10 min. The pellet was washed 
with -20 °C cold acetone (Sigma 650501), vortexed, and centrifuged 
at 21,130g at 4 °C for 10 min. Following centrifugation, acetone was 
aspirated and the pellet was washed with acetone three more times. 
After the last washing step, the pellet was resuspended in 8 M urea, 
100 mM sodium phosphate pH 8, 100 mM ammonium bicarbonate 
and 1% SDS (w/v), and then rotated at room temperature until fully 
dissolved. Resuspended proteins were centrifuged at 21,130g at room 
temperature for 10 min and the cleared supernatant was transferred to 
anew microcentrifuge tube. To reduce disulphides, we added 10 mM 
Tris (2-carboxyethyl)phosphine hydrochloride (TCEP-HCI; Thermo 
Fisher Scientific PG82089) in Milli-Q water titrated to pH 7.5 with 
sodium hydroxide. To alkylate free cysteine, we added freshly pre- 
pared 400 mM iodoacetamide (Thermo Fisher Scientific 90034) stock 
solution inSO mM ammonium bicarbonate to the supernatant to a final 
concentration of 20 mM, then immediately vortexed and incubated 
the solution in the dark for 25 min at room temperature. After alkyla- 
tion, freshly prepared dithiothreitol (DTT) stock solution was added 
toa final concentration of 50 mM in order to quench alkylation. Water 
was added to each sample to reach a final concentration of 4 M urea 
and 0.5% (w/v) SDS. 

A100 pl suspension equivalent per sample of streptavidin mag- 
netic beads was washed twice with a solution containing 4 M urea, 


0.5% SDS (w/v) and 100 mM sodium phosphate pH 8, and was added 
to each 1 mg of protein sample, diluting each sample with an equal 
amount of water to reach a final concentration of 2 M urea, 0.25% SDS 
(w/v), 50 mM sodium phosphate pH 8 during pull-down. The tubes 
were rotated overnight at 4 °C. Following streptavidin pull-down, the 
magnetic beads were washed three times with a solution containing 
4 Murea, 0.5% SDS (w/v), 100 mM sodium phosphate pH 8, and three 
times with the same buffer without SDS. The beads were transferred to 
new tubes for the last wash step. Before final pulldown of the beads for 
mass-spectrometry analysis, streptavidin-HRP blotting was performed 
on 5% of the resuspended beads. 


On-bead digestion and TMT labelling 

Samples were prepared as described’. Liquid reagents used were of high- 
performance liquid chromatography (HPLC) quality grade. Washed 
beads were resuspended in 50 pl of 200 mM 3-[4-(2-hydroxyethyl)- 
piperazin-1-yl]propane-1-sulfonic acid (EPPS) buffer pH 8.5, 2% 
acetonitrile (v/v) with 1 pl of LysC stock solution (2 mg ml, Wako), 
vortexed briefly and incubated at 37 °C for 3 h. Then, 50 pl of trypsin 
stock (Promega V5111) diluted 1/100 (v/v) in200 mM EPPS pH 8.5 was 
added. After mixing, digests were incubated at 37 °C overnight and 
beads were magnetically removed. Peptides were directly labelled 
after digest. For this, acetonitrile was added to aconcentration of 30% 
(v/v) and peptides were labelled with TMT 10-plex or 11-plex reagents 
(Thermo Fisher Scientific 90406 and A34807) for 1h. Reactions were 
quenched with hydroxylamine ata final concentration of 0.3% (v/v) for 
15 min, and 1% of labelled peptides were analysed for efficiency of label 
incorporation and relative ratios by mass spectrometry. After quench- 
ing, peptide solutions were acidified with formic acid, trifluoroacetic 
acid (TFA) was added to a concentration of 0.1%, and peptides were 
desalted and fractionated by high pH reversed phase chromatography 
(Thermo Fisher Scientific 84868). After loading of labelled peptides 
onto preconditioned columns and a single wash with water, excess 
unincorporated TMT label was removed by washing reversed phase 
columns once with 0.1% trimethylamine (TEA) buffer containing 5% 
acetonitrile. Samples were fractionated under alkaline conditions into 
12 fractions with increasing concentrations of acetonitrile: 10%, 12.5%, 
15%, 17.5%, 20%, 25%, 30%, 35%, 40%, 50%, 65% and 80%. Fractions 1 
and 7, 2 and 8, 3 and 9, 4 and 10, 5 and 11, and 6 and 12 were pooled to 
obtain 6 final pooled fractions for subsequent analysis. Pooled frac- 
tions were dried to completion and further purified and desalted by 
acidic C,, solid phase extraction (StageTip). Labelled peptides were 
finally resuspended in 1% formic acid (v/v) and 3% acetonitrile (v/v). 


Whole-cell proteomics 

Isolated nontransgenic cardiomyocytes were lysed by homogenization 
(QIAshredder cartridges, Qiagen) in lysis buffer (2% SDS, 150 mM NaCl, 
50 mM Tris pH 7.4). Lysates were reduced with5 mM DTT and alkylated 
with 15 mM iodoacetamide for 30 min inthe dark; alkylation reactions 
were quenched with freshly prepared DTT added to aconcentration 
of 50 mM, and proteins were precipitated by methanol/chloroform 
precipitation. Digests were carried out in 1M urea freshly prepared 
in200 mM EPPS pH 8.5 in the presence of 2% acetonitrile (v/v) with 
LysC (Wako, 2 mg mI, 1/75 (w/w) protease/substrates during digest) 
for 3h at room temperature and after subsequent addition of trypsin 
(Promega V5111, stock 1/100 w/w protease/substrates) overnight at 
37 °C. The missed cleavage rate was assayed from a small aliquot by 
mass spectrometry. For whole-proteome analysis, digests contain- 
ing approximately 60 pg of peptide material were directly labelled 
with TMT reagents (Thermo Fisher Scientific). Labelling efficiency 
and TMT ratios were assayed by mass spectrometry, while labelling 
reactions were stored at —80 °C. After quenching of TMT labelling reac- 
tions with hydroxylamine, TMT labelling reactions were mixed, solvent 
evaporated to near completion, and TMT-labelled peptides purified 
and desalted by acidic reversed phase C,, chromatography. Peptides 


were then fractionated by alkaline reversed phase chromatography 
into 96 fractions and combined into 24 samples. 


Mass-spectrometry analysis 

Data collection followed a MultiNotch MS? TMT method® using an 
Orbitrap Lumos mass spectrometer coupled to a Proxeon EASY-nLC 
1200 liquid chromatography system (both Thermo Fisher Scientific). 
The capillary column used was packed with C,, resin (length 35 cm, inner 
diameter 75 tm, matrix 2.6 pm Accucore (Thermo Fisher Scientific). 
Peptides of each fraction were separated for 4 h over acidic acetoni- 
trile gradients by liquid chromatography before mass-spectrometry 
analysis. The scan sequence started with an MS' scan (Orbitrap analysis; 
resolution 120,000; mass range 400-1,400 Th). MS? analysis followed 
collision-induced dissociation (CID; CID energy =35) with amaximum 
ion-injection time of 150-300 ms and an isolation window of 0.4 m/z. 
In order to obtain quantitative information, we fragmented MS? pre- 
cursors by high-energy collision-induced dissociation (HCD) and ana- 
lysed the fragments in the Orbitrap at a resolution of 50,000 at 200 Th. 
Further details on liquid-chromatography and mass-spectrometry 
parameters and settings were described recently“. 

Peptides were searched with aSEQUEST (v.28, rev. 12)-based software 
against a size-sorted forward and reverse database of the Mus musculus 
proteome (Uniprot 07/2014) with added common contaminant pro- 
teins. For this, spectra were first converted to mZXML. Searches were 
performed using a mass tolerance of 20 p.p.m. for precursors anda 
fragment-ion tolerance of 0.9 Da. For the searches, amaximum of two 
missed cleavages per peptide was allowed. We searched dynamically 
for oxidized methionine residues (+15.9949 Da) and, where indicated, 
for phospho-modification of serine, threonine and tyrosine residues 
(+79.9663 Da). We applied a target decoy database strategy, and set 
a false discovery rate (FDR) of 1% for peptide-spectrum matches fol- 
lowing filtering by linear discriminant analysis (LDA)”°*°. The FDR for 
final collapsed proteins was 1%. We calibrated MS! data post-search and 
performed searches again. We used a modified version of the Ascore 
algorithm to quantify the confidence assignment of phosphorylation 
sites. Phosphorylation localized to particular residues required Ascore 
values of more than 13 (P< 0.05) for confident localization”. Quanti- 
tative information on peptides was derived from MS? scans. Quant 
tables were generated requiring an MS? isolation specificity of more 
than 70% for each peptide and a sum of TMT signal-to-noise (s/n) ratios 
of more than 200 over all channels for any given peptide; the tables 
were exported to Excel and processed further therein. Details of the 
TMT intensity quantification method and further search parameters 
were described previously”. Proteomics raw data and search results 
were deposited in the PRIDE archive” and can be accessed under 
ProteomexXchange accession nunbers PXD014499, PXD014500 and 
PXDO14501. 

The relative summed TMT s/n ratios for proteins between two experi- 
mental conditions (referred to as ‘enriched’) were calculated fromthe 
sum of TMT s/n ratios for all peptides quantified of a given protein. 
For enrichment of Gene Ontology (GO) terms, the BINGO package in 
Cytoscape was used”. Scaled quantification data were subjected to 
two-way clustering (JMP software package) and changes in enrichment 
were analysed using Graphpad Prism 8 (Graphpad Software). Statistical 
significance was determined by multiple t-tests without correction for 
multiple comparisons and a= 0.05. Data for relative protein quantifica- 
tion can be found in Supplementary Tables 1, 3-6. 


Phosphoproteomic analysis of Rad 

HEK293T cells were cultured in DMEM with 10% FBS and 1% penicil- 
lin/streptomycin. Cells were transfected with GFP-labelled Rad using 
Lipofectamine 2000 as above. The medium was changed 4-6 h after 
transfection. After 24h, cells were treated with trypsin and spun down 
for 5 min at 1,000 r.p.m. Cells were then resuspended in PBS with 10 uM 
forskolin for 5 min. After washing cell pellets three times with PBS, cells 


were frozen at —80 °C. Cold cell pellets were lysed in PBS with 0.1% triton 
X-100 (v/v, Sigma) and a phosphatase-inhibitor mixture (Complete 
and PhosSTOP, both from Roche) by pipetting up and down several 
times. Lysates were homogenized by passing them through QlAshred- 
der cartridges (Qiagen) and incubated with GFP-trap agarose beads 
(Chromotek, Germany) for 4h at 4 °C with constant rotation. Beads 
were washed three times with PBS with 0.1% (v/v) Triton and three times 
with detergent-free PBS and subjected to on-bead digest with trypsin 
(Promega VS111), LysC (Wako) or ArgC (Promega V1881, ArgC digestion 
buffer 50 mM Tris-HCL pH 7.8, 5 mM CaCl,,2mM EDTA, 2% acetonitrile 
(v/v)) separately as described above overnight at 37 °C. After acidifica- 
tion, peptides were purified by reversed phase C,; chromatography and 
subjected to MS/MS analysis. For this, the same parameters as above 
for MS' and MS? scans were used with an isolation window of 1.2 Da 
and taking a neutral loss of 97.9763 Da into account, with multi-stage 
activation (MSA) set for MS’ scans. Analysis of phospho-site localiza- 
tion was performed as above. 


Fractional shortening 

Freshly isolated myocytes were perfused with a Tyrode’s solution con- 
taining 1.8 mM CaCl,. Myocytes were field stimulated at 1 Hz. Nisoldi- 
pine (300 nM) dissolved in Tyrode’s solution was then superfused. 
Fractional shortening of sarcomere length was measured using the 
SarcLen module of lonoptix. 


Whole-cell patch-clamp electrophysiology 

Isolated cardiomyocytes or HEK cells attached on glass 8 x 8 mm cov- 
erslips were placed in Bioptechs Delta T dishes filled with solution 
containing 112 mM NaCl, 5.4 mM KCI, 1.7 mM NaH,PO,, 1.6 mM MgCl, 
20.4mM HEPES pH 7.2, 30 mM taurine, 2 mM D/L-carnitine, 2.3 mM cre- 
atine and 5.4 mM glucose. The Petri dishes were mounted onthe stage 
of aninverted microscope and served as a perfusion chamber. After 
establishing a seal and achieving whole-cell configuration, external 
solutions were changed by the fast local perfusion method. 

Forcardiomyocytes, pipette resistance was between 1and3 MQ. Mem- 
brane currents were measured by the conventional (ruptured) whole- 
cell patch-clamp method using a MultiClamp 700B or Axopath200B 
amplifier and pCLAMP 10.7 software (Molecular Devices). Capacitance 
transients and series resistance were compensated. Voltage was cor- 
rected for liquid junction potential (-10 mV) during analysis. Leak 
currents were subtracted by a P/4 protocol. The parameters of voltage- 
dependent activation were obtained using a modified Boltzmann distri- 
bution: /(V) = Gyrax X (V- Exey)/[1 + exp(V— Vso)/V.)], where /(V) is the peak 
current, G,,,, is the maximal conductance, F,,., is the reversal potential, 
V9 is the midpoint, and V, is the slope factor. 

The pipette solution contained 40 mM CsCl, 80 mM caesium glu- 
conate, 10 mM1,2-bis(o-aminophenoxy)ethane-N,N,N’, N’-tetraacetic 
acid (BAPTA), 1mM MgCl, 4 mM Mg-ATP, 2 mM CaCl, and 10 mM 
4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), adjusted 
to pH 7.2 with CsOH. After the isolated cardiomyocytes were dialysed 
and adequately buffered with 10 mM BAPTA in the internal solution, 
cells were superfused with 140 mM tetraethylammonium chloride 
(TEA-Cl), 1.8 mM CaCl,,1mM MgCl,, 10 mM glucose and 10 mM HEPES, 
adjusted to pH 7.4 with CsOH. To measure peak currents, we held the 
cell membrane potential at -SO mV and stepped it to +O mV for 350 ms 
every 10s. To evaluate the current-voltage (/- V) relationship in cardio- 
myocytes, we repeated the same protocol with steps between -40 mV 
and +60 mVin10-mV increments. Nisoldipine (Santa Cruz) was stored 
protected from light at -20 °C as3 mMstock in ethanol. The final dilu- 
tion of nisoldipine to 300 nM was inthe extracellular recording solution 
immediately before the experiment. Isoproterenol (Sigma 15627) and 
forskolin (Santa Cruz) were prepared daily and diluted in extracellular 
solution. 

For HEK293T cell experiments involving the forskolin-induced 
stimulation of Cay1.2, Cay1.3 and Cay2.2 currents, we implemented 
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the perforated whole-cell patch-clamp technique in order to minimize 
current run-downand preserve the intracellular milieu. Amphotericin B 
(Sigma A9528) was initially dissolved in DMSO (20 mg mI) and used in 
the pipette solution ata final concentration of 200 pg mI. Thetip of the 
pipette was filled with amphotericin-free solution containing 80 mM 
caesium gluconate, 40 mM CsCl, 10 mM HEPES, 10 mM BAPTA, 1 mM 
MgCl, and1mM Mg-aATP, pH adjusted to 7.2 with CsOH. The pipette was 
backfilled with 125 mM CsCl, 10 mM HEPES, 4 mM CaCl,, 1 mM MgCl, 
pH7.2 and CsOH containing amphotericin at 200 pg mI“. CaCl, (4 mM) 
was added to the patch electrode solution to enable the detection of 
conversion from perforated to ruptured configuration. The external 
solution contained 130 mM tetraethylammonium methanesulfonate, 
10 mM HEPES, 1 mM MgCl,, 10 mM (with Rad expression) or 2 mM 
(without Rad expression) BaCl,, 5 mM glucose. For experiments with 
HEK293T cells, in addition to step protocols, we used aramp protocol 
with a200-ms voltage ramp from-60 mV to +60 mV (0.6 Vs”) applied 
every 10 s to monitor the /-V relationship. All experiments were per- 
formed at room temperature, 22 +1 °C. Cells were selected on the basis 
of co-transfection of a vector containing GFP in the absence of Rad, or 
of a vector containing GFP-conjugated Rad. For both cardiomyocytes 
and HEK293T cells, cells without a stable baseline (possibly due to run- 
down or run-up) were not studied. 

The voltage-steps protocol used in cardiomyocytes studies evalu- 
ated Ipeak = Ipeax (V), which was recalculated in CLAMPFIT to G= G(V) 
as G=//(V-E,..,). For HEK cell experiments, we used a ramp protocol 
(Extended Data Fig. 8b). The experimental records were /=/(t), where 
t=time. We transformed these records to /=/(V), which was then fur- 
ther transformed to G = G(V) in CLAMPFIT (Extended Data Fig. 8b). 
The parameters of voltage-dependent activation were obtained using 
a modified Boltzmann distribution. A nonlinear fit with Boltzmann 
sigmoidal function was carried out in Prism (GraphPad), with a good- 
ness of fit (R’) of more than 0.99. 


Single-channel patch-clamp electrophysiology 

Cell-attached single-channel recordings were performed at room 
temperature as described”’”. Patch pipettes (5-10 MQ) were pulled 
from ultra-thick-walled borosilicate glass (BF200-116-10, Sutter Instru- 
ments), and coated with Sylgard. Currents were filtered at 2 kHz. The 
pipette solution contained 140 mM tetraethylammonium methane- 
sulfonate; 10 mM HEPES; 40 mM BaCL,; at 300 mOsm“, adjusted with 
tetraethylammonium methanesulfonate; and pH 7.4, adjusted with 
tetraethylammonium hydroxide. To maintain the membrane potential 
at O mV, the bath contained 132 mM potassium glutamate, 5 mM KCI, 
5mMNaCl,3 mM MgCl,,2mM EGTA, 10 mM glucose, 20 mM HEPES; at 
300 mOsmI!", adjusted with glucose; and pH 7.4, adjusted with sodium 
hydroxide. Cell-attached single-channel currents were measured dur- 
ing 200-ms voltage ramps between —80 mV and +70 mV (portions 
between —50 mV and +40 mV are displayed and analysed). For each 
patch, we recorded 80-120 sweeps with a repetition interval of 10s. 

The sample size for the P,-V relationships are as follows:0,¢, 2 =10 
(3 transfections), 933 sweeps; 0): plus PKA, n=5 (2transfections), 450 
sweeps; Rad, n=5 (3 transfections), 372 sweeps; Rad plus PKA, n=8 (3 
transfections), 527 sweeps; 4SA mutant Rad, n= 6 (4 transfections), 
388 sweeps; and 4SA mutant Rad plus PKA, n=5 (3 transfections), 439 
sweeps. 

The modal analyses are histograms of single-trial average P, values 
obtained from one-channel patches. For P,-V analysis, we conserva- 
tively corrected for shifts in voltage. The numbers of sweeps are as 
follows: a, alone: 281 sweeps (from three one-channel patches); Rad, 
372 sweeps (from five one-channel patches); and Rad plus PKA, 311 
sweeps (from three one-channel patches). 


Flow-cytometric FRET two-hybrid assay 
For flow-cytometric FRET assays, HEK293 cells (ATCC CRL1573) were 
cultured in 12-well plates and transfected with polyethylenimine (PEI) 


25 kDa linear polymer (Polysciences number 2396602). Briefly, 1.5 pg 
of cerulean (Cer)- and venus (Ven)-tagged cDNA pairs were mixed 
together in 100 pl of serum-free DMEM media and 6 ul of PEI was 
added to each sterile tube. FRET experiments were performed one 
day post-transfection. The protein-synthesis inhibitor cycloheximide 
(100 pM) was added to cells 2 h before experimentation to halt syn- 
thesis of new fluorophores, in order to allow existing fluorophores 
to fully mature. 

For FRET measurements, we used an LSR II (BD Biosciences) flow 
cytometer, equipped with 405-nm, 488-nm and 633-nm lasers for exci- 
tation and 18 different emission channels. Forward- and side-scatter 
signals were detected and used to gate for single and healthy cells. To 
determine FRET efficiency, we measured three distinct fluorescence 
signals: first, S,.. (corresponding to emission from the cerulean tag) is 
measured through the BV421 channel (excitation, 405 nm; emission, 
450/50); second, S\., (corresponding to emission from the venus tag) 
is measured via the FITC channel (excitation, 405 nm; dichroic, 5O5LP; 
emission, 525/50); and third, S-,-+ (corresponding to FRET emission) is 
measured via the BV510 channel (excitation, 405 nm; dichroic, 5OSLP; 
emission, 525/50). These raw fluorescence measurements are subse- 
quently used to obtain Vengi,e.. Venus emission due to direct excita- 
tion), Cefgirece (Cerulean emission due to direct excitation), and Ven;p-+ 
(venus emission due to FRET excitation). Flow-cytometric signals were 
collected at a medium flow rate (2,000 to 8,000 events per second). 
Fluorescence data were exported as FCS 3.0 files for further processing 
and analysis using custom MATLAB functions (MathWorks). 

For each experimental run on the flow cytometer, we performed 
several control experiments. First, the background fluorescence level 
for each fluorescent channel (BG,,,, BGye, and BG; p+) was obtained by 
measuring fluorescence from cells exposed to PEI without any fluoro- 
phore-containing plasmids. Second, cells expressing the Ven fluoro- 
phore alone were used to measure the spectral crosstalk parameter 
R,,, corresponding to bleed-through of Ven fluorescence into the FRET 
channel. Third, cells expressing the Cer fluorophore alone were used 
to measure spectral crosstalk parameters Rp, and Rp», corresponding 
to bleed-through of Cer fluorescence into the FRET and Ven channels 
respectively. Fourth, FRET measurements also require determination of 
instrument-specific calibration parameters 2\on/2cer AN fven/fcer Which 
are respectively ratios of fluorescence excitation and emission for Ven 
to Cer fluorophores. These parameters also incorporate fluorophore- 
dependent aspects, including molar extinction (for g) and quantum 
yield (for f), as well as instrument-specific parameters, including laser 
power, attenuation by optical components, and photodetection, ampli- 
fication and digitization of fluorescence. To determine these param- 
eters, we used Cer-Ven dimers with four different linker lengths (5, 32, 
40 and 228). Fifth, coexpression of Cer and Ven fluorophores provided 
estimates for concentration-dependent collisional FRET. 

In our experiments, R,, was approximately 0.11, Rj, approximately 
2.8, and R,, approximately 0.006. We observed only minor day-to-day 
variation in these measurements. For each cell, spectral cross-talk was 
subtracted as follows: Cer girect =Rp1 X Scer? VEN direct = Rar ¥ (Sven — Roo * Scer)s 
and Vengper = Spret ~ Rar % (Sven ~ Rp2 ¥ Scer) ~ Ror ¥ Scer- 

Following spectral unmixing, we obtained 2yen/Scer ANG fren! {cer from 
data for Cer—-Ven dimers by determining the slope and intercept for 
the following linear relationship: 


Venepet = Seer x VeNgirect pc Fen 
Cefgirect 8ven Cefgirect Seer 


For typical experiments, 2yen/Zcer = 0.0194 and fven/fcer = 2.3441. Hav- 
ing obtained these calibration values, we computed donor-centric 
FRET efficiencies as: 


Veneret 


Ep= 
fen 
Verirper + 3%, % C€ldirect 


For Cer-Ven dimers, we obtained FRET efficiencies of roughly 0.55, 
0.38, and 0.05 for linker lengths 5, 32, and 228 respectively, matching 
previous work”. The relative proportion of Cer and Ven fluorophores 
in each cell was determined as Neo, = Cel girect/ (1 - Ep) ANd Myen = VEN girect/ 
(Sven/Zcer *fver/fcer)« TO construct FRET two-hybrid-binding curves, we 
imposed a1:1 binding isotherm as in previous studies”. For each FRET 
pair, we obtained effective dissociation constant (K4 err), Ep max and 95% 
confidence intervals by constrained least-squares fit. 

For Fig. 4b, c and Extended Data Fig. 9a, b, we show all individual 
cells from two different transfections. We fit these data with the equa- 
tion Ep = Emax X [Ven — Rad ]ree/([Ven — Rad] ce + Ka err) using the least- 
squares algorithm (https://www.mathworks.com/help/stats/nlinfit. 
html). The Kg <r; fit value based on all of the data points is shown as 
the bar in Extended Data Fig. 9c. The error is calculated by the fitting 
algorithm as a 95% confidence interval on the fit parameter (https:// 
www.mathworks.com/help/stats/nIparci.html). 


Statistical analysis 

Experiments were not randomized. Results are presented as 
mean+s.e.m. For comparisons between two groups, we used Student’s 
t-test. Statistical analyses were performed using Prism 8 (Graphpad 
Software). For multiple group comparisons, we performed a one-way 
ANOVA followed by either Dunnett’s or Tukey’s post-hoc test using 
Prism 8. Differences were considered statistically significant at values 
of P<0.05. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data and material availability 


All transgenic mice are available from S.O.M. under a material agree- 
ment with Columbia University. All data are available in the main text, 
the Extended Data or the Supplementary Information. Proteomics 
raw data and search results were deposited in the PRIDE archive and 
can be accessed via the ProteomeXchange under accession numbers 
PXDO014499, PXDO14500 and PXDO14501. The FRET software is accessible 
on github at https://github.com/manubenjohny/FACS FRET. Source data 
for Figs. 1-4 and Extended Data Figs. 1, 2,6, 8 are provided with the paper. 


42. Katchman, A. et al. Proteolytic cleavage and PKA phosphorylation of a,, subunit are not 
required for adrenergic regulation of Cay1.2 in the heart. Proc. Natl Acad. Sci. USA 114, 
9194-9199 (2017). 

43. Yang, L. et al. The PDZ motif of the a1C subunit is not required for surface trafficking and 
adrenergic modulation of CaV1.2 channel in the heart. J. Biol. Chem. 290, 2166-2174 (2015). 

44. Sanbe, A. et al. Reengineering inducible cardiac-specific transgenesis with an attenuated 
myosin heavy chain promoter. Circ. Res. 92, 609-616 (2003). 

45. Hambleton, M. et al. Inducible and myocyte-specific inhibition of PKCa enhances cardiac 
contractility and protects against infarction-induced heart failure. Am. J. Physiol. Heart 
Circ. Physiol. 293, H3768-H3771 (2007). 

46. He,M., Bodi, |., Mikala, G. & Schwartz, A. Motif III S5 of L-type calcium channels is involved 
in the dihydropyridine binding site. A combined radioligand binding and 
electrophysiological study. J. Biol. Chem. 272, 2629-2633 (1997). 

47. Hockerman, G. H. et al. Construction of a high-affinity receptor site for dihydropyridine 
agonists and antagonists by single amino acid substitutions in a non-L-type Ca” channel. 
Proc. Natl Acad. Sci. USA 94, 14906-14911 (1997). 

48. Lam,S.S. et al. Directed evolution of APEX2 for electron microscopy and proximity 
labeling. Nat. Methods 12, 51-54 (2015). 

49. Bramson, H.N., Kaiser, E. T. & Mildvan, A. S. Mechanistic studies of cAMP-dependent 
protein kinase action. CRC Crit. Rev. Biochem. 15, 93-124 (1984). 

50. Kemp, B.E., Graves, D. J., Benjamini, E. & Krebs, E. G. Role of multiple basic residues in 
determining the substrate specificity of cyclic AMP-dependent protein kinase. J. Biol. 
Chem. 252, 4888-4894 (1977). 

51. Moore, M. J., Adams, J. A. & Taylor, S. S. Structural basis for peptide binding in protein 
kinase A. Role of glutamic acid 203 and tyrosine 204 in the peptide-positioning loop. 

J. Biol. Chem. 278, 10613-10618 (2003). 

52. Feramisco, J.R., Glass, D. B. & Krebs, E. G. Optimal spatial requirements for the location of 
basic residues in peptide substrates for the cyclic AMP-dependent protein kinase. J. Biol. 
Chem. 255, 4240-4245 (1980). 


53. Songyang, Z. et al. Use of an oriented peptide library to determine the optimal substrates 
of protein kinases. Curr. Biol. 4, 973-982 (1994). 

54. Neuberger, G., Schneider, G. & Eisenhaber, F. pkaPS: prediction of protein kinase A 
phosphorylation sites with the simplified kinase-substrate binding model. Biol. Direct 2, 1 
(2007). 

55. lakoucheva, L. M. et al. The importance of intrinsic disorder for protein phosphorylation. 
Nucleic Acids Res. 32, 1037-1049 (2004). 

56. Zhou, F.F., Xue, Y., Chen, G. L. & Yao, X. GPS: a novel group-based phosphorylation 
predicting and scoring method. Biochem. Biophys. Res. Commun. 325, 1443-1448 
(2004). 

57. Blom,N., Gammeltoft, S. & Brunak, S. Sequence and structure-based prediction of 
eukaryotic protein phosphorylation sites. J. Mol. Biol. 294, 1351-1362 (1999). 

58. Obenauer, J. C., Cantley, L. C. & Yaffe, M. B. Scansite 2.0: proteome-wide prediction of 
cell signaling interactions using short sequence motifs. Nucleic Acids Res. 31, 3635-3641 
(2003). 

59. Valencik, M. L. & McDonald, J. A. Codon optimization markedly improves doxycycline 
regulated gene expression in the mouse heart. Transgenic Res. 10, 269-275 (2001). 

60. Wan, E. et al. Aberrant sodium influx causes cardiomyopathy and atrial fibrillation in mice. 
J. Clin. Invest. 126, 112-122 (2016). 

61. Yang, L., Katchman, A., Morrow, J. P., Doshi, D. & Marx, S. O. Cardiac L-type calcium 
channel (Cav1.2) associates with gamma subunits. FASEB J. 25, 928-936 (2011). 

62. Yang, L. et al. Protein kinase C isoforms differentially phosphorylate Ca,1.2 a,,. 
Biochemistry 48, 6674-6683 (2009). 

63. Marx, S. O. et al. PKA phosphorylation dissociates FKBP12.6 from the calcium release 
channel (ryanodine receptor): defective regulation in failing hearts. Cell 101, 365-376 
(2000). 

64. Kalocsay, M. APEX peroxidase-catalyzed proximity labeling and multiplexed quantitative 
proteomics. Methods Mol. Biol. 2008, 41-55 (2019). 

65. McAlister, G. C. et al. MultiNotch MS3 enables accurate, sensitive, and multiplexed 
detection of differential expression across cancer cell line proteomes. Anal. Chem. 86, 
7150-7158 (2014). 

66. Beausoleil, S. A., Villén, J., Gerber, S. A., Rush, J. & Gygi, S. P. A probability-based 
approach for high-throughput protein phosphorylation analysis and site localization. Nat. 
Biotechnol. 24, 1285-1292 (2006). 

67. Paulo, J. A. et al. Quantitative mass spectrometry-based multiplexing compares the 
abundance of 5000 S. cerevisiae proteins across 10 carbon sources. J. Proteomics 148, 
85-93 (2016). 

68. Perez-Riverol, Y. et al. The PRIDE database and related tools and resources in 2019: 
improving support for quantification data. Nucleic Acids Res. 47 (D1), D442-D450 (2019). 

69. Deutsch, E. W. et al. The ProteomeXchange consortium in 2017: supporting the cultural 
change in proteomics public data deposition. Nucleic Acids Res. 45 (D1), D1100-D1106 
(2017). 

70. Maere, S., Heymans, K. & Kuiper, M. BINGO: a Cytoscape plugin to assess 
overrepresentation of gene ontology categories in biological networks. Bioinformatics 21, 
3448-3449 (2005). 

71. Tay, L.H. et al. Nanodomain Ca” of Ca?* channels detected by a tethered genetically 
encoded Ca” sensor. Nat. Commun. 3, 778 (2012). 

72. Adams, P. J., Ben-Johny, M., Dick, |. E., Inoue, T. & Yue, D. T. Apocalmodulin itself promotes 
ion channel opening and Ca” regulation. Cell 159, 608-622 (2014). 


Acknowledgements We thank A. Karlin for helpful discussions and editing the manuscript; 

B. Soda for creating the cell-model illustration; and G. A. Bradshaw for technical assistance. 
This publication was supported by the National Institutes of Health (NIH; grants RO1 HL113136, 
RO1 HL121253 and RO1 HL146149) and by the National Center for Advancing Translational 
Sciences (grant UL1TROO1873). These studies used the resources of the Herbert Irving 
Comprehensive Cancer Center Flow Cytometry Shared Resources, funded in part through 
Center Grant P30CA013696. Images were collected (and analysed) in the Confocal and 
Specialized Microscopy Shared Resource of the Herbert Irving Comprehensive Cancer Center 
at Columbia University, supported by NIH grant P30 CA013696 (National Cancer Institute). 
A.P. was supported by NIH grant T32 HL120826 and National Science Foundation (NSF) 
Division of Graduate Education (DGE) grant 1644869. D.R. was supported by grants T32 
HL120826 and F31 HL142178. J.K. was supported by grant T32 HLOO7343 and the Glorney- 
Raisbeck Fellowship from the New York Academy of Medicine, and J.A.H. was supported by 
grant T32 HLOO7854. The content is solely the responsibility of the authors and does not 
necessarily represent the official views of the NIH. 


Author contributions The following authors designed research and analysed data: G.L., A.P., 
A.N.K., S.1.Z., D.R., J.A.H., J.K., LY., B.-X.C., A.K, S.P.G., G.S.P., H.M.C., M.B.-J., M.K. and S.O.M. The 
following authors performed research and analysed data: G.L., A.P., A.N.K., S.I.Z., D.R., J.A.H., 
J.K., LY., B.-X.C., A.K, K.D., G.S.P., H.M.C., M.B.J., M.K. and S.O.M. The following authors wrote 
the paper with input from all authors: S.O.M., M.K., H.M.C., G.S.P. and M.B.J. All authors 
provided feedback and agreed on the final manuscript. 


Competing interests The authors declare no competing interests. 


Additional information 

Supplementary information is available for this paper at https://doi.org/10.1038/s41586-020- 
1947-z. 

Correspondence and requests for materials should be addressed to M.K. or S.O.M. 

Peer review information Nature thanks Donald Bers, Alice Ting and the other, anonymous, 
reviewer(s) for their contribution to the peer review of this work. 

Reprints and permissions information is available at http://www.nature.com/reprints. 


Article 


a Ae b 

Residue # (rabbit)| Sequence | 35- mutant lsoproterenol/ 

1 24 HLS HLA Forskolin - + - t+ - + - + 

2 31 |KGT KGA 

3 97 KLMGS KLMGA 

4 | 124/125/126 _|KKQGSTT_ | KKQGAAA 

5 149 RRACIS RRACIA 

6 495 RNMS RNMA 

7 |__528/530/533__ |RISKSKFS_|RIAKAKFA 

8 811/813 KKLARTAS |KKLARAAA 

9 840 KSIT KSIA 

10 893 RPLS RPLA 

11 993 KGS KGA 

12 1517 RDWS. RDWA 

13 1575 RLVS RLVA 

14 1604 RIKT RIKA 

15 1626/1627 _|KRTS KRAA 

16 1674 KPS KPA 

17 1680 RNALS RNALA 

18 1700/1704 __|RRAISGDLT|RRAIAGDLA| C o pWla 

19 1718 KEAVS KEAVA 

20 1738 HVS HVA 3.5 © 35-0 

21 1761 HIS HIA e ° ° WTB 

22 1781 KLVDST KLVDAA 2 3.0 © 28-8 

23 1826 HGS HGA a 0 000 © 35-0 x 28-6 

24 1842/43 KLSS KLAA + 257% 

25 1848 KRCHS KRCHA © 2.0] 9% %% 

26 1912 RLS RLA o PR% .° 25 

27 7920/22 |RSAS RAAA B 15] googete °2 00 © 

28 1928 RRAS RRAA = © D0Be % 82° o ae 

29 |_1972/74/76/77_|RSHSPSS__| RAHAPAA S$ 10 oe aM . 

30 2015/16 KLNSS KLNAA 3 _ — 

31 2024 HCGS HCGA Oo 05 

32 2036/37 RGDSS RGDAA ee 

33 2046/48 RPVSLT RPVALA 0 

34 2063 FHGS FHGA . T r T . T 7 

35 2120 RQS RQA 0 20 40 60 

Basal current density (pA/pF) 

1[- ae © GFP-tagged f 
2, 8 28-6 mutant 35-a mutant 
4 KSGGNA inna kDa 
5 
6{ 195 i 
7 KKT GFP-B amt 95 
— Ped _ Be 
9{ 268 RV 


KTA 
KIAA 
RGKA 
KAA 


478/479/480/481 
489/491 
496 
500 


Extended Data Fig. 1| Putative PKA phosphorylation sites in «,,and B,, 
subunits. a, Left column, the 35 putative PKA phosphorylation sites in rabbit 
Qc. Centre, the 51 residues in red are either predicted phosphorylation sites or 
within the immediate region of the predicted phosphorylation sites. Right, 

all 51 residues were replaced with alanine in 35-mutant a, transgenic mice. 

b, Combined bar and columnscatter plot of Boltzmann function parameters, 
V9. Data are mean +s.e.m. **P< 0.01; ***P< 0.001; ****P< 0.0001 by paired 
two-tailed t-test. pWT a, n=19; 35-a mutant, n=14; 28-B mutant, n=16; 35-0 
mutant x 28-8 mutant, n= 24. Specific Pvalues can be found inthe 

associated Source Data (see Supplementary Information). c, Graph showing 
isoproterenol- and forskolin-induced increases in nisoldipine-resistant 
current, stratified by total basal current density before nisoldipine treatment. 
d, Left, the 28 putative PKA phosphorylation sites in the N-terminal (NT), Hook, 
GK and C-terminal (CT) domains of B,,. Centre, the 37 residues in red are either 
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predicted phosphorylation sites or within the immediate vicinity of predicted 
phosphorylation sites, and were mutated to alanine in the 28-mutant GFP- 
tagged B.,, transgenic mice (right). e, Fluorescence imaging of isolated 
cardiomyocytes expressing the GFP-tagged 28-B mutant. Representative of 
images from more than five biologically independent mice. f, Anti-B-subunit 
immunoblot of cleared lysates from doxycycline-fed 35-mutant a, transgenic 
(TG) mice or 35-mutant a,, x GFP-tagged 28-mutant B,, expressing mice hearts. 
Representative of immunoblots obtained from at least three biologically 
independent mice. g, Anti-Flag antibody (upper) and anti-B antibody (lower) 
immunoblots of anti-Flag antibody immunoprecipitations from cleared lysates 
of hearts from pWT, 35-a and three mice expressing 35-a x GFP-tagged-28-B. 
Representative images from two independent experiments. For source gel 
data, see Supplementary Fig. 1. 
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Extended Data Fig. 2| Trafficking and function of APEX2-conjugated Cay1.2 
subunits in heart. a, Exemplar current-voltage relationship of Ca” currents 
from cardiomyocytes of a,--APEX2 mice, acquired in the absence (black trace) 
and presence (red trace) of nisoldipine. Insets show series of whole-cell Cay1.2 
currents recorded from aseries of voltage steps between -40 mV and +60 mV 
from a holding potential of -50 mV. Scale bars, horizontal 100 ms, vertical 

10 pA/pF. Representative of five experiments. b, Time course of changes in 
sarcomere length after superfusion of nisoldipine-containing solution. 
Representative of seven experiments. c, Percentage sarcomere shortening in 
the presence of nisoldipine. Data are mean +s.e.m. ****P< 0.0001 by unpaired 
two-tailed t-test. n=12 and 7 cardiomyocytes from left to right. NTG, 
nontransgenic. d, Immunofluorescence of cardiomyocytes isolated from mice 
expressing o,--APEX2 and B,,-APEX2, exposed to biotin-phenol and H,O, orno 
H,0,. Staining is with anti-V5 and Alexa594-conjugated secondary antibodies 
and streptavidin-conjugated Alexa488, and nuclear labelling is with DAPI. 
Scale bar, 5 um. Representative of 13 and 8 cardiomyocytes from 2 and 3 mice 
respectively. e, Streptavidin-HRP blot of lysates from isolated ventricular 
cardiomyocytes, representative of five similar experiments. f, Exemplar whole- 


cell Cay1.2 currents recorded from cardiomyocytes of a,--APEX2 transgenic 
mice. Black trace, 300 nM nisoldipine; blue trace, 200 nM isoproterenol plus 
nisoldipine. Representative of nine cells from two biologically independent 
mice. g, Asinf, except from B,,-APEX2 mice. Black trace, control; blue trace, 
200 nM isoproterenol. Representative of seven experiments from two 
biologically independent mice. h, i, Anti-phospho-phospholamban 
immunoblot of proteins from cardiomyocytes isolated from a,--APEX2 and 
B.,-APEX2 mice. Cardiomyocytes were exposed to either vehicle or 1M 
isoproterenol after incubation with biotin-phenol. Blots are representative of 
three independent experiments from at least five biologically independent 
mice for each genotype.j, Asin hand i, except that cardiomyocytes were 
isolated from non-transgenic mice without incubation with biotin-phenol. 
Blotis representative of three independent experiments from three 
biologically independent mice. k, As handi, except that whole heart was 
exposed to1pM isoproterenol for 5 min after infusion of biotin-phenol. This 
blot is representative of at least five biologically independent mice for no 
isoproterenol and at least five biologically independent mice for isoproterenol. 
For source gel data, see Supplementary Fig. 1. 
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Supplementary Table 1 shows all 3,883 proteins quantified by multiplexed SPS 
MS? TMT mass spectrometry. b, Prefuse force-directed map of proteins froma. 
Peptide counts were used as weight. Proteins mapping to the GOterm ‘Z disc’ 
are in green, to ‘membrane’ in yellow, and to both are in purple. a,-APEX2 and 
B.,-APEX2 are in blue. c, GO term (cellular localization) enrichment for 
proteins ina. See Supplementary Table 2 for the full table. 


Extended Data Fig. 3 | Analysis of proteins quantified by mass spectrometry 
in cardiomyocytes isolated from a,,-APEX2 and B,,-APEX2 mice. a, Proteins 
with aratio of more than 2 (measured by normalized TMT signal/noise) in the 
indicated experimental conditions, compared witha no-labelling control 

(no H,0,), were sorted by spectral counts. The 150 proteins with the highest 
peptide counts are displayed in this colour-coded table. a,--APEX2 and 
B.,-APEX2 data were collected in biological duplicate experiments. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | Two-way hierarchical clustering of scaled data from 
Fig. 2.a, Dendrogram showing two-way hierarchical clustering of scaled TMT 
s/n data for streptavidin-purified proteins from a,.-APEX2 cardiomyocytes 
after stimulation with vehicle or isoproterenol. Shownare scaled relative TMT 
protein quantification data for 1,951 proteins from biological quintuplicate 
Q,--APEX2 mice. Clustering used Ward’s minimum-variance method. 

b, Dendrogram showing two-way hierarchical clustering of scaled relative 
quantification data for 1,936 proteins from biological triplicate B,,-APEX2 
experiments. Heterogeneity between cardiomyocyte preparations from 
different mice is apparent. c, Dendrogram showing two-way hierarchical 


clustering of scaled relative quantification data for 2,610 proteins from whole- 
organ a,--APEX without or with perfusion of isoproterenol. Prominent 
heterogeneity in relative protein quantification between hearts is apparent. 
The position of Rad is indicated by a red line. In this experiment, the individual 
hearts were not paired. d, Dendrogram showing two-way hierarchical 
clustering of scaled TMT s/n data from non-transgenic mouse cardiomyocytes 
stimulated with isoproterenol or with vehicle. Scaled data for 4,622 quantified 
proteins froma biological quadruplicate experiment are displayed. Pairing of 
samples is apparent. 
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Extended Data Fig. 5| See next page for caption. 


Article 


Extended Data Fig. 5 | Isoproterenol-induced change in Rad detected by 
mass spectrometry. a, Right, the MS? spectrum (top) and TMT quantification 
parameters (bottom) for aRad peptide changed upontreatment of murine 
hearts with isoproterenol. Shown is the MS’ spectrum that was used to identify 
the Rad peptide IFGGIEDGPEAEAAGHTYDR. Left, m/z ratios for band y ions 
identified inthe spectrum and their deviation from theoretical m/z ratios. 

We measured the precursor mass as 778.71 Da witha charge of +3. Peptide 
modifications were +229.16 Da for TMT onthe peptide N terminus and lysine 


residues, +57.02 Da for cysteine alkylation and +15.99 for methionine oxidation. 


Shownare ion injection times, isolation specificity, sum of signal-to-noise (SN) 
over all TMT channels, TMT raw intensities, adjusted intensities and final SN 
intensities used for relative quantification, as well as synchronous precursor 
selection (SPS) ion m/z ratios (isolated in the ion trap with Notch filtering; 


‘Notch mz’ denotes the ion m/zratio of individual isolated SPS ions prior to 
HCD fragmentation and MS’). b, Table showing gene names of proteins with 
P<0.05 for the three approaches: cardiomyocytes isolated from a,--APEX and 
Bop-APEX mice, and a,--APEX hearts. Genes in yellow are common toall groups, 
but note that for Mast2, the fold change is not consistent. Data are mean 

fold changes for five pairs of biologically independent pairs of a,.-APEX2 
cardiomyocyte samples, three pairs of biologically independent pairs of 
B.,-APEX cardiomyocyte samples, and ten o,--APEX2 hearts, five without 
isoproterenol and five with isoproterenol. Non-adjusted unpaired two-tailed 
t-test.c, Venn diagram showing the data from b. Rrad, Rad; Prkaca, PKA 
catalytic subunit; Acss1, acyl-CoA synthetase short-chain family member 1. Rad 
is the only protein that is consistently changed amongst the three approaches. 
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Extended Data Fig. 6 | Rad is required for forskolin-induced activation of 
heterologously expressed Cay1.2 channels. a, Exemplar whole-cell Cay1.2 
currents elicited from step depolarizations recorded from HEK293T cells 
expressing Rad. Voltage command steps were applied every 10 s before (black 
traces) and during (blue traces) forskolin treatment. Representative of at least 
tencells.b, Methodology used for generating G/Vcurves. i, Upper,a200-ms 
voltage ramp from -60 mV to +60 mV was applied every 10 s. Lower, current 
traces, each an average of three traces before (black) and three traces after 
(blue) forskolin treatment. ii, Conversion of time scale to applied voltage. 

iii, Conversion to G-Vrelationship. Fold change was calculated at G,,,,.¢, Graph 
showing forskolin-induced fold change in current, stratified by basal current 
density. d, Exemplar traces of Ba” currents in the absence and presence of Rad 
elicited by voltage ramp every 10s. Black traces, before forskolin treatment; 
blue traces after treatment; no Rad, 7 cells; Rad, 16 cells. e, Boltzmann function 


) 50 
Voltage (mV) 
parameter V;). Data are mean +s.e.m. **P< 0.01 by paired two-tailed t-test.n=7 
and 16, from left to right. f, g, Ratio of Ba” current after forskolin treatment to 
Ba” current before treatment for cells transfected without and with Rad. 
Representative of analyses for three cells for each condition. h-l, Distribution 
of sweep-by-sweep average P, (single-trial P,) for different conditions. h, Inthe 
absence of Rad, sweeps with no openings or blank sweeps are rare (10%); most 
sweeps exhibit either intermediate or high levels of openings. i, The fraction of 
blank sweeps is increased with expression of Rad.j, Ifthe PKA catalytic domain 
is also coexpressed with Rad, the fraction of blank sweeps is reduced and there 
is aresurgence of the high-activity mode. k, Pale blue lines show conditional 
P,-voltage relationships obtained for sweeps exhibiting high activity in the 
absence of Rad and PKA. The dark blue line is the Boltzmann fit. 1, As ink, but 
with Rad and PKA expression. 
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c Phosphorylation Site Overview 
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Extended Data Fig. 7| PKA phosphorylation sites in mouse Rad. a, Serine/ 
threonine residues (in purple) that are mutated to alanine in the 14-SA mutant. 
b, Mass-spectrometry identification of phosphorylated residues on Rad 
enriched with an anti-GFP nanobody matrix, from HEK cells expressing 
GFP-Rad and treated with forskolin. The number of spectral counts is plotted 
against the position of the phosphorylated amino acids in Rad. We detected 
534 aggregated phosphopeptides in two independent experiments. 

c, Database entry for phosphorylation sites identified previously in Rad 


272 

300 
(https://phosphomouse.hms.harvard.edu/site_view.php?ref=IPI00133102). 
The highest level of Rad phosphorylation was detected in the heart (left panel). 
The lower right panel shows peptides detected with phosphorylated serine 
residues on positions 25, 38 and 300 (in bold red; mapped to the Rad 
expression constructs used here; blue highlighting indicates sequence 
covered by peptides). d, Serine residues mutated to alanine in the 4-SA mutant 
(arrows). 


a Rad mutant 


MTLNGGSGASGSRGAGRERDRRRGSTPWGPAP PLHRRSMPV DERDLOAALAPGSLAT TAAGT RTOGORLD 
WPEGSSDSLSSGGSGSEEGV YKV LLLGAPGV GKSALARIFGGIEDGP EAEAAGHTYDRSITVDGEEASLL 
VYDIWEQDGGCWLPGHCMAMGDAYV IVYSITDKGSFEKASELRV OLRRAROTDDVPI ILVGNKSDLVRSR 


EVSVDEGRACAVVFDCKF IETSAALHHNV QALFEGVVROIRLRRDSKEDNARROAGTRRRES LGKKAKRF 
LGRIVARNSRKMAFRAKSKSCHDLSVL 


B., Mutant 
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YSAAHEDDV PV PGMAISFEAKDFLHV KEKFNNDWW IGRLVKEGCEIGFIPSPVKLENMRLOH EORAK 
FYSSKSGGNSSSSLGDIV PSSRKSTPPSSAID IDATGLDAEE NDIPANHRSP KPSAN SV TSPHSKEK 
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LLSRTLATSSLPLSPTLASNSOGSOGD ORTDRSAPIRSASQAEEEPSVEPV KKSOHRSSSSAPHHNH 
TSRGLSROETFDSETOQESRDSAYV EPKEDYSH DHV DHYASHR DHNHRDETHGSSDHRHRESRHRSRD 
EQDHNECNKQRSRHKSKDRYCEKDGEV ISKKRNEAGEWNRDV YIPQ 
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Extended Data Fig. 8 | Binding of Rad and B,, is required for regulation of paired two-tailed t-test. The data for wild-type Rad are the same as in Fig. 3h. 
forskolin-induced stimulation of voltage-gated Ca” channels. a, The Rad Specific P values can be found in the associated Source Data.n=16,19 and 
protein sequence shown here indicates the residues Arg208 and Leu235 that 13, from left to right. e, Fold changeinG,,,, in Ca,1.3 channels. Data are 
were substituted with alanine (yellow). b, The B,, protein sequence indicates mean +s.e.m.P<0.0001 by one-way ANOVA; ****P< 0.0001 by Dunnett’s test. 
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(yellow), resulting in attenuation of Rad binding to the B subunit, as described and 9 cells, from left to right. f, Fold change in G,,,, in Ca,2.2 channels. Data are 
previously*”’. c, Ba”* currents from Cay1.2 channels, elicited by voltage ramp mean +s.e.m. P< 0.001 by one-way ANOVA; ***P< 0.001 by Dunnett’s test. Data 
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d, Boltzmann function parameter V;,. Dataare mean +s.e.m. ***P<0.001by 
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Extended Data Fig. 9 | Phosphorylation-dependent dissociation of Rad, B, 
and B, subunits. a, b, FRET two-hybrid binding isotherms were determined for 
Cer-tagged B, (a) and B, (b) subunits, and N-terminal Ven-tagged wild-type 
(left) or 4-SA mutant (right) Rad. FRET efficiency (£,) is plotted against the free 
concentration Ven-WT or Ven-4SA-mutant Rad. The solid line fits a1/1 binding 
isotherm. Coexpression of the PKA catalytic subunit weakened binding in cells 
expressing wild-type Rad, but not in cells expressing 4-SA mutant Rad. c, Bar 


graphsummarizing mean K,4¢r¢ for B22, B; and B,, and wild-type and 4-SA mutant 
Rad, expressed without and with the PKA catalytic subunit. Dataare 

mean + 95% confidence intervals; error bars show 95% confidence intervals for 
the pooled nonlinear fits based onthe Jacobians computed. The sample size for 
each condition is 1,580-10,364 cells, acquired via two independent 
transfections and then pooled. The distribution of datainthis graphis 
reflected in Fig. 4b,c, andina, b. 
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Extended Data Fig. 10 | ClustalW alignment of Rad sequences and RGK 
GTPases. a, Conservation of phosphorylation sites from mouse Rad (Ser25, 
Ser38, Ser272 and Ser300) in other species. Blue highlights basic amino acids 
(arginine, lysine and histidine), and red highlights serine and threonine. 

b, C-terminal phosphorylation sites are conserved in other species. The 


a 
Ser272 


equivalent of the Ser25 phosphorylation site is conserved in Rem1, and the 
equivalent of Ser38 is probably conserved in Gem. Phosphorylation sites in 
mouse Rad (Ser25, Ser38, Ser272 and Ser300) are indicated with arrows. Blue 
highlights basic amino acids (arginine, lysine and histidine), and red highlights 


serine and threonine. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


x| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[x]|[__| A description of all covariates tested 


x A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


[x] A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
4 AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


z O For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection pClamp 10.7 software (Molecular Devices), Sequest (V.28, rev. 12) 


Data analysis Graphpad Prism 8, Origin 7.5, JMP 14 Pro, The FRET software is on github: https://github.com/manubenjohny/FACS_FRET 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


The authors declare that the data supporting the findings of this study are available within the paper, its supplemental figures and tables, and all raw mass 
spectrometry data have been deposited. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes exceeded the number of samples determined by power calculations, which were based on effect sizes previously reported in the 
literature. Number of animals was always greater than 3 per genotype. Sample size was determined to be adequate based on the 
magnitude and consistency of measurable differences between groups and low observed variability between samples. 


Data exclusions No data were excluded from analyses. For electrophysiological studies, cells were not studied if baseline electrophysiological determinations 
demonstrated current run-down. These cells were excluded before any experiment was initiated (a predetermined criteria). Once 
nisoldipine, isoproterenol or forskolin was added, the cells were included in all analyses. See Methods for details. 


Replication All experiments were replicated and all attempts at replication were successful and consistent. 


Randomization No randomization of mice. Mice were litter-mates when possible. 
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Blinding The investigators were blinded to group allocation during data acquisition and analysis. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
[_]|[¥] Antibodies [x]][_] ChiP-seq 
[x Eukaryotic cell lines x Flow cytometry 
x Palaeontology x MRI-based neuroimaging 


[x Animals and other organisms 


x Human research participants 


*]|[_] Clinical data 


Antibodies 


Antibodies used Additional information for all antibodies are provided in the Methods section. 
Anti-V5 Tag Monoclonal Antibody(Thermofisher, R960-25) 1:5000 dilution, 
Anti-Jph2 Antibody(Pierce, PAS-20642, lot# NG1583142) 1:1000 dilution, 
Anti-Calmodulin Antibody(Millipore Sigma, 05-173),1:1000 dilution, 
Anti-RyR2 Antibody(5029) dilute 1:5000, 
Anti-Kv1.5 Antibody(Alomone Labs, APC-004.Lot# APCOO4AN0850), 1:1000 dilution, 
Anti-phospho-phospholamban(Ser16/Thur17) Antibody(Cell signaling, #8496, Lot#1) 1:1000 dilution, 


Goat anti-Mouse IgG (H+L) Highly Cross-Adsorbed Secondary Antibody, Alexa Fluor 594 (Thermofisher, A11032, Lot#2069816) 
1:200 dilution, 


Streptavidin, Alexa FluorTM 488 Conjugate(Thermofisher, S32354, Lot# 1719656), 1:800 dilution, 
Streptavidin, horseradish peroxidase conjugate(Thermofisher, s911, Lot# 1711896), 0.6mg/ml, 
Anti-Alpha1C antibody (custom-made by YenZym) 1:1000 dilution 

Anti-CaVbeta antibody (custom-made by YenZym) 1:500 dilution 

Anti- FLAG antibody (Sigma, F7425, Lot#O78M4886V) 1:1000 dilution 


Validation These antibodies were used for both heterologuous expression studies and mouse cardiomyocytes. 
The alpha1C antibody was custom-made by YenZym and has been used for more than a decade in the laboratory. It has been 
validated using heterologous expression studies. 
The anti-beta antibody was also custom-made by YenZym and has been used for more than a decade in my laboratory. We have 
validated this antibody using heterologous expression studies. Antibodies were also validated in our previous work (Yang et al 
2019). 
The anti-jutophilin antibody and anti-FLAG antibody were validated using heterologous expression studies, by their manufacturer 
and in the case of anti-FLAG antibody using transgenic mice created in the laboratory. 
Anti-V5 Tag Monoclonal Antibody- validated by 310 published papers for WB and 70 published papers for Immunofluorescence. 
Anti-Jph2 Antibody- validated by Invitrogen and published paper. 
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Anti-Calmodulin Antibody- validated by Millipore Sigma and also multiple published papers. 
Anti-RyR2 Antibody - validated by multiple published papers and heterologous expression studies. 
Anti-Kv1.5 Antibody- validated by Alomone Laboratory and multiple published paper 


Anti-phospho-phospholamban(Ser16/Thur17) Antibody validated by Cell Signaling and multiple published papers 
Goat anti-Mouse IgG (H+L) Highly Cross-Adsorbed Secondary Antibody, Alexa Fluor 594 validated by Invitrogen. 
Streptavidin, Alexa FluorTM 488 Conjugate validated by Invitrogen and published papers. 

Streptavidin, horseradish peroxidase conjugate validated by Invitrogen and published papers 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) HEK293T cells (human, ATCC, CRL-3216) for electrophysiology; HEK293 (human, ATCC, CRL-1573) for FRET were purchased 
directly from ATCC. 

Authentication Cell lines are authenticated by ATCC. 

Mycoplasma contamination All cell lines were tested negative for mycoplasma 
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Commonly misidentified lines No commonly misidentified cell lines were used. 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Mice, mixed strain, ages 6 weeks - 4 months, male and female. Description of research mice used for experiments can be found 
in the relevant figure legends and Methods. 


Wild animals The study did not involve wild animals 
Field-collected samples The study did not involve samples collected from the field 
Ethics oversight Columbia University IACUC 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Transcription challenges the integrity of replicating chromosomes by generating 
topological stress and conflicts with forks'”. The DNA topoisomerases Top] and Top2 
and the HMGB family protein Hmol assist DNA replication and transcription® °. Here 
we describe the topological architecture of genes in Saccharomyces cerevisiae during 
the Gland S phases of the cell cycle. We found under-wound DNA at gene boundaries 


and over-wound DNA within coding regions. This arrangement does not depend on 
Pol ll or S phase. Top2 and Hmol preserve negative supercoil at gene boundaries, 
while Top] acts at coding regions. Transcription generates RNA-DNA hybrids within 
coding regions, independently of fork orientation. During S phase, Hmol protects 
under-wound DNA from Top2, while Top2 confines Pol II and Top1 at coding units, 
counteracting transcription leakage and aberrant hybrids at gene boundaries. 
Negative supercoil at gene boundaries prevents supercoil diffusion and nucleosome 
repositioning at coding regions. DNA looping occurs at Top2 clusters. We propose 
that Hmol locks gene boundaries in a cruciform conformation and, with Top2, 
modulates the architecture of genes that retain the memory of the topological 
arrangements even when transcription is repressed. 


RNA polymerases generate positive and negative supercoils ahead 
and behind transcription bubbles, respectively’. Positive supercoiling 
accumulates in front of replication forks and precatenanes are gener- 
ated behind forks®*. TMP (4,5’,8-trimethylpsoralen) has been used to 
map DNA supercoiling” %, as intercalation of psoralen is proportional 
to negative superhelical tension”. Using biotinylated TMP (bTMP)”, we 
investigated the topology of transcribed genes and the contributions 
of Top1, Top2 and Hmol1to maintenance of the topological architecture 
of transcription units. 


Topological context of Pol II genes 


We analysed the distributions of Rpb3 (a Pol II subunit), Top2, Top1 and 
Hmol1inS phase and performed a meta-analysis on Pol II-transcribed 
genes (Fig. 1a). Rpbb3 accumulated at open reading frames (ORFs), peak- 
ing at transcription start sites (TSSs) and transcription termination 
sites (TTSs); this probably reflects slow transcription modes where 
transcription begins, and where transcription-coupled transactions 
occur at termination’. Top2 and Hmol accumulated upstream and 
downstream of ORFs. Top1 was confined within the ORFs, accumulat- 
ing close to TTSs. 

Using bTMP, we mapped negative and positive supercoil and stable 
regions” (Extended Data Fig. 1a, b). The three topological clusters were 
distributed near-equally (Extended Data Fig. 1c). Negative supercoiled 
regions were mainly found at intergenic regions (49%), whereas tran- 
scribed units exhibited a positive supercoiled context (40%; Fig. 1b, c). 
Nucleosome-occupied regions were distributed near-equally within the 


three topological territories (Extended Data Fig. Ic). Negative supercoil 
mirrored Top2 clusters (Fisher's exact test P< 1x10) and positive 
supercoil reflected Top] distribution (Fisher's exact test P< 110°). 
Gene topology was comparable in cells during G1 and S phase (Fig. 1b, c, 
Extended Data Fig. 1d, e). 

We analysed the topological profiles of conditionally expressed 
genes such as ASF2 (transcribed in S phase) and the galactose-inducible 
gene cluster (Extended Data Fig. 1f, g). We found comparable bTMP 
profiles at ASF2in G1 (repressing conditions) and S, and at the Gal genes 
in cells cultured with glucose (repressing conditions) and galactose. 
We analysed the locus containing the highly expressed LEU2 gene and 
the two moderately expressed NFS1 and DCCI genes (Extended Data 
Fig. 1h). Rpbb3 accumulated at LEU2 but was undetectable at NFSI and 
DCC1. However, Top1 accumulated at LEU2, as well as at NFS1 and DCC1. 
Top2 and Hmol were present at gene boundaries. Hence, the topologi- 
cal context of Pol II-transcribed genes does not depend on Pol II and 
the negative supercoil context at gene boundaries does not depend 
on Top2, as Top2 is recruited after G1 phase’®. 

We compared supercoil distribution among genes that showed high, 
medium or low expression (Extended Data Fig. 2a). Highly expressed 
genes accumulated more Top2 and negative supercoil, compared to 
the other two classes (Fig. 1d, e). Conversely, highly expressed genes 
exhibited less positive supercoil (Extended Data Fig. 2b). Accumulation 
of Pol Il and Top! mirrored the levels of expression (Fig. 1d), whereas 
distribution of Hmol was comparable in genes of all three levels of 
expression (Fig. 1d). Hence, distribution of under-wound DNA at Pol 
Il gene boundaries is enhanced in highly expressed genes. 


'IFOM (Fondazione Istituto FIRC di Oncologia Molecolare), Milan, Italy. *Medical Research Council Human Genetics Unit, Institute of Genetics and Molecular Medicine, University of Edinburgh, 
Edinburgh, UK. *Universita degli Studi di Milano, Milan, Italy. *e-mail: yathish.achar@ifom.eu; marco.foiani@ifom.eu 
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Fig. 1| Topological context of Pol II genes. Chromatin immunoprecipitation 
(ChIP)-on-chip was carried out in cells released from GlintoS phase. Pol II- 
coding regions (replicates n=2; meta-gene analysis n= 6,706 genes) were 
scaled to1kb and the flanking 0.5 kb from TSSs and TTSs were plotted against 
median intensity on the y-axis. a, Meta-gene plot showing accumulation of 

Pol II (Rpb3-10~ Flag), Top2 (Top2-10x Flag), Top1 (Top1-10x Flag) and Hmol1 
(Hmo1-10~ Flag). b, Meta-gene plot for negative supercoil in Gland S, plotted 
against median bTMP intensity. c, Positive supercoil distribution in Pol II genes 
plotted against average gene density onthe y-axis. d, Pol II genes were grouped 
into three categories; high, medium and low expression based on 

the fragments per million kilobases (FPKM) value from RNA sequencing 
carried out in S phase at 28 °C. Meta-gene plots for three categories of gene 
expression for Pol II, Top2, Top1and Hmol1.e, Negative supercoil distribution in 
high-, medium- and low-expression genes. f, Pol II genes were grouped 
according to their orientation with respect to neighbouring genes as: co- 
directional (+ strand; n=1,453 gene pairs), co-directional (— strand; n=1,415 
gene pairs), converging (n=1,590 gene pairs) and diverging (n=1,512 gene 
pairs). The numbers of gene pairs at different intergenic spaces (<250 bp=1,729 
gene pairs, 251-500 bp =2,224 gene pairs and >500 bp =2,010 gene pairs) were 
plotted with respect to their orientation. g, Base coverage percentage of 
supercoil accumulation at different intergenic spaces with respect to gene 
pairs grouped according to orientation (two replicates, mean +s.d.). 


We compared the supercoil context at intergenic spaces with respect 
to gene orientation by grouping Pol II genes into co-directional (plus 
and minus strands), converging and diverging classes (Fig. 1f). Inter- 
genic spaces between converging genes were smaller than in the other 
directional classes. Diverging genes exhibited larger intergenic spaces. 
Converging genes accumulated more positive supercoil at intergenic 
spaces, at the expense of negative supercoil (Fig. 1g). Accordingly, 
converging intergenic regions exhibited lower Top2 binding, whereas 
Top1 binding was not affected (Extended Data Fig. 2c). Hence, conver- 
gent and divergent transcription have imposed specific topological 
contexts at intergenic spaces. 
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Top2 and Hmol contribute to gene topology 


Temperature sensitive top2-1 mutants exhibited a reduction in negative 
supercoil at gene boundaries and an increase in positive supercoil at 
the same regions (Fig. 2a, Extended Data Fig. 2d, h). In G1 phase, top2-1 
mutants did not show areduction in negative supercoil (Extended Data 
Fig. 2e). top1A cells did not show changes in the topological context of 
gene boundaries or transcribed regions (Fig. 2a, Extended Data Fig. 2f, h). 
Like top2-1 mutants, top1Atop2-1 mutants exhibited a decrease in nega- 
tive supercoil at gene boundaries (Fig. 2a), although the accumulation 
of positive supercoil at the same regions was lower than in top2-1 cells 
(Extended Data Fig. 2g); top1Atop2-1 mutants accumulated less positive 
supercoil at transcribed regions than did wild-type cells, suggesting 
that, during transcription, the two topoisomerases can substitute for 
each other in maintaining a positive supercoiled context. Localiza- 
tion of Top] in wild-type cells was restricted to coding regions, but in 
top2-1 mutants it accumulated at gene boundaries (Fig. 2b, Extended 
Data Fig. 2i). Thus, Top2 restricts Top] at transcribed regions and top2 
mutants exhibit the unscheduled relocation of Top1 at gene boundaries, 
whichcan account for the local increase in positive supercoil. Moreover, 
in top2-1 mutants, Pol Il accumulated more at gene boundaries thanin 
wild-type cells (Fig. 2c, Extended Data Fig. 2i). Hence, Top2 confines 
the transcription apparatus within the coding regions. 

hmo!A cells exhibited a reduction in negative supercoil and accu- 
mulation of positive supercoil at gene boundaries, resembling top2-1 
mutants (Fig. 2d, Extended Data Fig. 3a, d). Ablation of Hmolin top2-1 
mutants restored a wild-type-like topological context at transcribed 
genes (Fig. 2d, Extended Data Fig. 3b, d). The distribution of Top1 in 
hmoiA and hmolAtop2-1 mutants was similar to that in wild-type cells 
(Extended Data Fig. 3c, e). Thus, the gene topological profiles of hmo1A 
and top2-1 are comparable, but hmo1A, unlike top2-1 mutation, does not 
cause accumulation of Top] at gene boundaries. It is possible that, in 
top2-1 mutants, accumulation of Top] at gene boundaries depends on 
DNA substrates generated by Hmol. 


Top2 restricts RNA-DNA hybrids within ORFs 


Using the SF9 antibody’”’, we investigated whether accumulation of 
RNA-DNA hybrids reflected a specific topological context. In wild- 
type cells, hybrids were distributed within ORFs and peaked at TTSs 
(Extended Data Fig. 4a). Their accumulation did not correlate with gene 
expression levels (Extended Data Fig. 4b). RNaseH and the Rrm3 and 
Senl helicases counteract hybrid accumulation”. Inrnh1A mutant cells, 
hybrids accumulated throughout Pol II gene units, whereas in rrm34 
and seni“ mutants hybrids accumulated at TTS sites (Extended Data 
Fig. 4c); this is consistent with the function of Rrm3 in dismantling RNA 
transcripts while travelling on the lagging strand” and with the role of 
Sen] in facilitating transcription termination”. These observations 
suggest that RNA-DNA hybrids represent a physiological intermediate 
during transcription and are confined within coding regions, and that 
their accumulation close to TTSs may reflect the slow-down of Pol II 
elongation at termination”. 

Intop2 mutants, specifically in S phase, hybrids accumulated at gene 
boundaries, where there is a reduction in negative supercoil (Fig. 2e). 
topItop2 double mutants resembled top2 mutants (Extended Data 
Fig. 4e). top1A cells accumulated hybrids throughout the gene bodies 
(Fig. 2e), perhaps owing to frequent Pol II pausing and back-tracking; 
because the viability of top1A cells depends on Top2, it is possible that 
top1A cells phenocopy a Top2 defect, leading to Pol II leakage and accu- 
mulation of hybrids at gene boundaries. Previous findings implicated 
Top] in preventing hybrid accumulation”. Hence, Top2 counteracts 
hybrid accumulation and, in top2 mutants, the accumulation of hybrids 
at flanking regions reflects the local decrease in negative supercoil and 
aberrant Pol II transcription. hmo1A cells exhibited a marked reduction 
inhybrid accumulation compared to wild-type cells, and hmo1Atop2-1 
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Fig. 2|Top2 and Hmol contribute to gene topology. G1 cells were released 

at 28 °C into S phase and temperature shifted to 37 °C for top2-1 mutants 
(replicates n=2; meta-gene analysis n= 6,706 genes). a, Meta-gene profiles for 
negative supercoil comparison in wild-type (WT), top2-1, top1A and top1Atop2-1 
cells. b,c, Meta-gene profiles comparing wild-type and top2-1 cells for 
accumulation of Top] protein (Top1-10x Flag) (b) or Pol II (Rpb3-10~ Flag) (c). 
d, Meta-gene profile comparison for negative supercoil in wild-type, hmo1A 
and hmolIAtop2-1 cells. e, Meta-gene profiles for RNA-DNA hybrid comparison 
in wild-type, top2-Jand top1A cells. f, Percentage of genes (G) and RNA-DNA 
hybrid (H) in either head-on or co-directional orientation with respect to 
replication forks. Genes were grouped with respect to their distance (0.25 kb, 
n=140 genes; 0.5kb,n=235 genes; 1kb, n=347 genes; 2 kb, n=539 genes; 5kb, 
n=1,121 genes) and direction (head-on or co-directional) from replication 
origin. 


mutants behaved similarly to hmo1A cells (Extended Data Fig. 4f, g). 
Hence, ablation of HMO1 also rescued the aberrant accumulation of 
hybrids at flanking regions in top2 mutants. 

We investigated whether a clash between forks and transcribed 
genes might influence the accumulation of hybrids, by analysing 173 
efficient replication origins”. Transcription units in a head-on or co- 
directional orientation with replication forks within 0.25, 0.5, 1, 2 or 
5 kb of the origin point were selected. There was a significant enrich- 
ment of transcribed genes oriented head-on with replication forks 
(Fig. 2f); this reflects the overlap between the signals that specify tran- 
scription termination and those that promote replication initiation”. 
However, the relative accumulation of hybrids in the head-on and the 
co-directional classes of genes were comparable (Fig. 2f, Extended Data 
Fig. 4h). Notably, the intergenic regions of converging genes were prone 
to accumulate hybrids, while this was not the case for the intergenic 
regions of divergent genes (Extended Data Fig. 4i). 


Negative supercoil affects gene architecture 


To validate the previous observations, we expressed Escherichia coli 
DNA topoisomerase I (TopA). TopA expression in top1top2 mutants 
depletes negative supercoil in plasmids”*. Wild-type and top1Atop2-1 
cells harbouring either control vector or TopA-expressing plasmids 
were analysed after 60 and 120 min at the restrictive temperature for 
top2-1 mutation (Fig. 3a, b). TopA expression in wild-type cells showed 
a reduction in negative supercoil at ORF-flanking regions and, in 
top1Atop2-1 cells, nearly abolished the negative supercoil at flanking 
regions (Fig. 3a, Extended Data Fig. 5a). Hence, the presence of Hmolat 
gene boundaries in top1Atop2-1 double mutants does not prevent TopA 
from resolving negative supercoil. TopA acts on negative supercoil to 
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Fig. 3 | Negative supercoil disruption causes disarray in nucleosome 
occupancy. Wild-type and top1Atop2-1 mutants either harbouring control 
plasmid or expressing E.coli DNA TopA plasmid were grownat 28 °C and shifted 
to 37 °C for 60 or 120 min to inactivate Top2 (replicates n=2; meta-gene 
analysis n= 6,706 genes). a, Meta-gene profiles for negative supercoil 
comparison in wild-type [Control plasmid], wild-type [TopAl, top1Atop2-1 
[Control] and top1Atop2-1[TopA] at 60 min and120 minat restrictive 
temperature. b, Meta-gene profile for positive supercoil accumulation. c, 
Meta-gene profiles of histone H3 in wild-type and top1Atop2-1 mutants with 
control or TopA plasmids plotted against median read coverage onthey-axis. 


convert it into positive supercoil’*. Accordingly, the disappearance 
of negative supercoil at flanking regions paralleled the progressive 
accumulation of over-wound DNA at the same location (Fig. 3b). 
Expression of TopA caused a reduction in positive supercoil at tran- 
scribed regions in wild-type cells, whereas it had the opposite effect 
in top1Atop2-1 mutants (Fig. 3b). This could result from the diffusion 
of supercoil waves across the entire gene bodies, perhaps owing to 
the destruction of topological or architectural confinements. Binding 
of Hmol was reduced in top1Atop2-1 mutants compared to wild-type 
cells and was nearly abolished in top1Atop2-1 cells expressing TopA 
(Extended Data Fig. 5b), indicating that association of Hmol with gene 
boundaries depends on negative supercoil. 

In wild-type cells, histone H3 was distributed at transcribed units 
but was less abundant at gene boundaries (Fig. 3c). top1top2 mutants 
resembled wild-type cells, suggesting that the aberrant topological 
context inthe double mutants did not affect the nucleosome context. 
Expression of TopA did not alter nucleosome positioning and distribu- 
tion in wild-type cells, but in top1top2 mutants it caused reduction of 
H3 distribution (Extended Data Fig. Sc). Moreover, H3 redistributed as 
its levels increased at flanking regions and, concomitantly, decreased 
at transcribed units, starting from position +2 (Fig. 3c, Extended Data 
Fig. 5d). Hence, expression of TopA caused an increase in positive 
supercoil followed by diffusion of supercoil waves across the entire 
gene body and massive nucleosome repositioning. 


Top2 mediates chromatin loop formation 


Using the chromatin interaction analysis by paired-end tag sequenc- 
ing (ChIA-PET)” method, we investigated whether Top2 mediates the 
formation of chromatin loops. We used Top2 as bait in S phase cells. 
Following DNA sequencing, we acquired one million independently 
mapped paired end tags (PETs) (Extended Data Fig. 6a, b) and, by keep- 
ing a1-kb minimum distance, we obtained 1,887 inter-ligation PET 
clusters (Extended Data Fig. 6b, c). The lengths of the Top2-mediated 
loops varied; some were larger than 10 kb (-100 interactions), while the 
majority of loops were between 1,500 and 2,000 bp in size witha median 
of 1,900 bp (Fig. 4a, b, Extended Data Fig. 6d). Sixty-four per cent of 
the interactions corresponded to previously described Top2-binding 
sites’ and, for the majority of the interactions (66%), Top2 was found 
only at oneend of the loop (Extended Data Fig. 6e). Overall, 45% of the 
Pol II genes were located within loops. Several loops were organized 
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Fig. 4 | Top2 mediates chromatin loop formation. a, Box plot showing loop 
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in clusters, with 31% of loops containing more than one gene and 51% 
of loops containing a single gene (Fig. 4b, Extended Data Fig. 6e, f). 


Discussion 


We have shown that Top1 localizes at coding regions. Top2 instead acts 
at negatively supercoiled gene boundaries and engages genes in loop- 
like structures, bringing promoters and terminators into proximity”®. 
Multiple twin topological domains are likely to be generated within 
the gene loops by waves of Pol II complexes”. Whereas in prokaryotes 
negative supercoil behind the first transcription bubble might adsorb 
the positive supercoil generated by the next approaching Pol II com- 
plex®, in eukaryotes, it may enable nucleosome assembly following 
Pol Il passage”. Hence, eukaryotic RNA polymerase progression might 
depend strongly on Top1in resolving topological stress in front of Pol 
II]. Accordingly, accumulation of Top] at coding regions depends on 
transcription levels. Coding regions exhibit a positive supercoiled 
context, even when transcription is repressed, implying that genes 
retain a ‘memory’ of a topological architecture that does not reflect 
the dynamics of elongating Pol II’*”. 

The negatively supercoiled regions that flank ORFs are refractory to 
nucleosome formation; TopA depletes negative supercoil specifically 
at these regions. Hence, gene boundaries exhibit an ideal topological 
context to ‘breathe out’ and undergo alternative structural transitions”. 
Nucleosome-free negatively supercoiled regions can form pseudo-cru- 
ciform structures™, characterized by two B-DNA duplex arms and two 
intra-strand plectonemic arms inanon-B-DNA conformation (Extended 
Data Fig. 7a). Such structures can branch-migrate, thereby modulating 
the extension of the intra-strand plectonemic duplexes*. Like other 
HMG box proteins®, Hmol binds four-way junctions with high affin- 
ity. Moreover, it stabilizes nucleosome-free regions, and dimerizes to 
promote DNA bridging”. We propose that Hmol locks cruciform DNA 
and thereby counteracts branch migration and nucleosome forma- 
tion. Stable negatively supercoiled gene boundaries in a cruciform 
conformation might help to insulate the topological architecture of 
gene loops to facilitate elongation of the multiple Pol II complexes, 
allowing efficient recycling of Pol II from TTS to TSS. Dimerization of 
Hmol” may promote gene looping, even in G1, without the mediation 
of Top2. Pol II movement and transcription-coupled processes, such as 
gene gating and/or splicing, might also contribute to gene looping by 
extruding portions of the transcribed DNA*. In S phase, Top2 would 
act at the loop base, probably to counteract the disruptive potential 
of incoming forks and/or to reset gene topology after fork passage 
(Extended Data Fig. 7b). Notably, Top2-dependent DNA loops can con- 
tain more than onetranscription unit, and can be organized in clusters, 
thus generating complex topological structures. 

S phase cells accumulate RNA-DNA hybrids at ORFs in about 45% of 
genes, independent of gene expression levels and of the direction of 
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transcription-replication. However, the intergenic regions of converg- 
ing genes exhibit a bias for hybrid accumulation. Pol II back-tracking 
during elongation, and a slow-down of Pol II during termination®”*°, 
could account for the formation of hybrids at ORFs under physiological 
conditions. The under-wound DNA behind Pol II can easily accommo- 
date RNA-DNA pairing”, and might even muffle the negative supercoil 
generated by Pol II movement. Our data suggest that hybrid formation 
is a physiological event, intrinsic to the topological dynamics generated 
by transcription and co-transcriptional processes. However, converging 
genes might generate the context for unscheduled genotoxic events, 
as in the case of CSR-activated B cells*”. 

Our model (Extended Data Fig. 7b) leads to the following predictions. 
(i) Recruitment of Hmol at gene boundaries would depend on their 
negative supercoil state; Hmol would then generate stable negative 
supercoiled cruciforms at gene boundaries (Extended Data Fig. 7a). 
Hmolis always found at negative supercoiled and nucleosome-free 
regions flanking ORFs, whereas Top2 is recruited in S phase. Counter- 
acting negative supercoil at gene boundaries prevents recruitment 
of Hmol. Without Hmol, cruciforms would be unstable but remain 
in anegative supercoiled state, becoming an ideal substrate for Top2 
(Extended Data Fig. 7c). Accordingly, inactivation of Top2 in hmo1 cells 
rescues negative supercoil at gene boundaries. (ii) In top2 mutants, 
negative supercoil decreases at ORF-flanking regions, probably owing 
to the unscheduled and massive recruitment of Top1 at gene bound- 
aries. Notably, non-B-DNA structures can be a substrate for Top1*?, 
and Top! can efficiently relax both positive and negative supercoil**. 
Hence, Hmol cannot protect cruciforms from Top] activity when top2 
is mutated, implying that, in top2 mutants, Top1 might cause genotoxic 
events at Hmol-locked cruciforms, such as extensive nicking and/or 
knotting*’ (Extended Data Fig. 7c). Notably, deletion of HMO1 in top2 
mutants, besides alleviating top2 temperature sensitivity’, prevents 
relocation of Top] at flanking regions, and hmoItop2 double mutants 
exhibit a wild-type-like topological context. (iii) InS phase top2mutants, 
Pol II leaks outside the canonical transcribed regions. This aberrant 
Pol II distribution is likely to reflect the inability of top2 mutants to 
recycle Pol lI from TTSs to TSSs, owing to the loss of proximity between 
promoters and terminators. In this view, Top2 might protect the gene 
loop structure from incoming forks. (iv) The aberrant Pol II distribu- 
tionin top2 mutants may also account for hybrid accumulation at gene 
boundaries. In top2 mutants, hybrid accumulation downstream of ORFs 
may result from aberrant transcription termination, while upstream 
of ORFs it might be facilitated by the Top1-mediated processing of 
cruciform DNA. In fact, top2top1 mutants exhibit fewer hybrids than 
single top2 mutants. Another possibility is that Top2 defects promote 
aberrant antisense transcription initiation events close to TSSs. 

The hybrids that accumulate at gene boundaries in top2 mutants may 
generate genotoxic events and the unscheduled synthesis of small RNA 
species, and might contribute to absorbance of the negative supercoil, 
thus implying that negative supercoil reduction at flanking regions may 
represent an indirect consequence of Top1 relocation. 

Our observations suggest that Top1, Top2 and Hmol contribute to the 
topological architecture of transcribed genes, particularly in S phase 
when forks reset the topological states of chromosomes and their chro- 
matin context. Interfering with the topological context of gene-flanking 
regions may cause a variety of pathological consequences, suchas the 
generation of aberrant RNA species, the accumulation of RNA-DNA 
hybrids and alterations at the level of chromatin architecture. 
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Methods 


Strains and growing conditions 

All S. cerevisiae strains are W303 derivatives**. The relevant genotypes 
are shown in Extended Data Table 1. Strains were grown at 28 °C in YPD 
medium. Gl synchronization was carried out using 3-5 pg/ml of « fac- 
tor. For S-phase samples, G1 cells were washed twice in YP medium and 
allowed to grow for 15 min in fresh medium. For temperature-sensitive 
strains, cells were allowed to grow for 10 min in fresh medium after 
G1 release, centrifuged and then dissolved in pre-warmed medium 
at 37 °C and allowed to grow for 15 min. Cell cycle progression into S 
phase was monitored by fluorescence-activated cell sorting (FACS) 
and budding profiles. For F£. coli TopA expression, wild-type cells and 
top1Atop2-1 mutants harbouring either control or TopA expression 
plasmids were grown at 25 °C in synthetic medium lacking leucine. 
Cells were shifted to 37 °C for inactivation of Top2 after reaching 8 x 10° 
cells/ml concentration. 


bTMP-ChIP 
We adapted the previously described method to yeast”. Sodium azide 
(0.1%) was used to block cells and to ensure the preservation of the most 
prevalent topological context present at any given genomic position. 
We note that this method does not aim to study dynamic topologi- 
cal transitions. Permeabilized yeast cells were incubated with bTMP 
(400 pg per 2 x 10° cells) in the dark for 90 min and then cross-linked 
by 365 nm UV (800 mJ/cm’) light to form adducts between two DNA 
strands. Cells were washed twice with ice-cold PBS and lysed in1 ml of 
lysis buffer (SO mM HEPES-KOH pH 7.5, 140 mM NaCl, 1mM EDTA, 1% 
Triton X-100, 0.1% Na-deoxycholate) using Zirconia beads. The cross- 
linked chromatin was sheared to an average size of 500 bp by 6 x 15-s 
pulses using a Biorupter sonicator and DNA was purified. Purified DNA 
was incubated with Dynabeads MyOne streptavidin (Invitrogen cat. no. 
65001) overnight at 4 °C. The beads were washed twice with each of the 
following buffers; wash buffer-I (20 mM Tris-HCL pH 8, 2 mM EDTA, 
150 mM NaCl, 1% Triton X, 0.1% SDS), wash buffer-II (20 mM Tris-HCL 
pH8,2mMEDTA, 500 mM NaCl, 1% Triton X, 0.1% SDS), wash buffer III 
(250 mM LiCl, 10 mM Tris pH 8.0, 0.5% Na-deoxycholate, 0.5% NP-40, 
1mM EDTA) and 1x TE (20 mM Tris pH 8.0, 2 mM EDTA). The bTMP- 
DNA complexes were eluted from the beads in 250 pl elution buffer 
(95% formamide, 10 mM EDTA) at 90 °C for 20 min and eluted samples 
were cleaned with a Qiagen PCR clean up kit. Input DNA was isolated 
from sheared chromatin input (1/100 of the material used for ChIP). For 
bTMP-ChIP with naked DNA, genomic DNA was isolated from Qiagen 
Genomic-tip 100/G (cat. no. 13343) and Genomic DNA Buffer Set (cat. 
no. 19060). Purified DNA was sheared to an average size of 500 bp by 
6x 15-s pulses using a Biorupter sonicator. bTMP was added to puri- 
fied DNA and incubated in the dark for 90 min and cross-linked with 
UV at 365 nm (800 mJ/cm?). DNA was precipitated using isopropanol 
and washed with 70% ethanol. The dried pellet was dissolved in buffer 
(50 mM Tris pH 8.0, 10 mM EDTA, 0.1% SDS) and incubated with Dyna- 
beads MyOne streptavidin (Invitrogen cat. no. 65001) overnight at 
4 °C. Washing and elution was as described above. Both IP and input 
samples were processed as described in the in microarray section. 
The procedure for bTMP titration is presented in Extended Data 
Fig. 8. bTMP binding normalization and the dispersion profile for bTMP 
are presented in Extended Data Fig. 9. 


Protein ChIP 

ChIP analysis for proteins was carried out as described” with few 
modifications. Cells were cross-linked with 1% formaldehyde in cul- 
ture medium for 30 min at room temperature followed by quenching 
with 0.125 M glycine for 5 min. Cells were washed twice with ice-cold 
PBS and lysed in 1 ml of lysis buffer (SO mM HEPES-KOH pH 7.5, 140 
mM NaCl, 1mM EDTA, 1% Triton X-100, 0.1% Na-deoxycholate) using 
Zirconia beads. Cross-linked chromatin was sheared to an average 


size of 500 bp by 6x 15-s pulses using a Biorupter sonicator. The lysate 
was then centrifuged to remove cell debris. The chromatin fraction 
was incubated with Dynabeads protein G beads (Invitrogen, cat. 
no. 10003D) coated with anti-Flag antibody (M2-antiflag, Sigma) over- 
night at 4 °C. Theimmune complexes were washed with the following 
buffers 2x; Chip-lysis buffer, high-salt lysis buffer (Chip-lysis buffer 
+360 mM NaCl), Chip-wash buffer (250 mM LiCl, 10 mM Tris pH 8.0, 
0.5% Na-deoxycholate, 0.5% NP-40, 1mM EDTA) and 1x TE(20 mM Tris 
pH 8.0, 2mM EDTA). The protein-DNA complexes were eluted from 
the beads in 250 ul elution buffer (1% SDS, 50 mM Tris pH 8.0, 10 mM 
EDTA) at 65 °C for 20 min followed by the addition of proteinase K to 
500 pg/ml and overnight incubation at 65 °C. Input DNA was isolated 
from sheared chromatin input (1/100 of the material used for ChIP). 
Both IP and input samples were processed as mentioned inthe section 
‘Microarray and data processing’. 


DRIP-ChIP 

DRIP-ChIP was performed using anti-DNA:RNA hybrid monoclonal 
mouse antibody S9.6 as previously described’. In brief, cells were 
cross-linked with 1% formaldehyde in culture medium for 20 min at 
room temperature followed by quenching with 0.125 M glycine for 
5 min. Cells were washed twice with ice-cold PBS and lysed in1 ml of lysis 
buffer (SO mM HEPES-KOH pH 7.5, 140 mM NaCl, 1 mM EDTA, 1% Triton 
X-100, 0.1% Na-deoxycholate) using Zirconia beads. Cross-linked chro- 
matin was sheared to an average size of 500 bp by 6x 15-s pulses using 
a Biorupter sonicator. The lysate was then centrifuged to remove cell 
debris. The chromatin fraction was incubated with Protein-A magnetic 
beads (Invitrogen, cat. no. 10001D) coated with anti-DNA:RNA hybrid 
$9.6 antibody” overnight at 4 °C. The immune complexes were washed 
with the following buffers 2x; Chip-lysis buffer, high-salt lysis buffer 
(Chip-lysis buffer + 360 mM NaCl), Chip-wash buffer (250 mM LiCl, 10 
mM Tris pH 8.0, 0.5% Na-deoxycholate, 0.5% NP-40, 1mM EDTA) and 1x 
TE (20 mM Tris pH 8.0, 2 mM EDTA). The RNA:DNA hybrid complexes 
were eluted from the beads in 250 ul elution buffer (1% SDS, 50 mM 
Tris pH 8.0, 10 mM EDTA) at 65 °C for 20 min followed by the addition 
of proteinase K to 500 pg/ml and overnight incubation at 65 °C. Input 
DNA was isolated from sheared chromatin input (1/100 of the material 
used for ChIP). Both IP and input samples were processed as mentioned 
in the section ‘Microarray and data processing’. 


Histone H3 ChIP sequencing 
ChIP analysis for proteins was carried out as described previously*. 
Inbrief, cells were cross-linked with 1% formaldehyde in culture medium 
for 15 min at room temperature followed by quenching with 0.125 M 
glycine for 5 min. Cells were washed twice with ice-cold PBS and lysed 
in1 ml of lysis buffer (SO mM HEPES-KOH pH 7.5, 140 mM NaCl, 1 mM 
EDTA, 1% Triton X-100, 0.1% Na-deoxycholate) using Zirconia beads. 
Cross-linked chromatin was sheared to an average size of 200 bp in 
Covaris S220 Focused Ultrasonicators. The lysate was then centri- 
fuged to remove cell debris. The chromatin fraction was incubated 
with Protein-G magnetic beads (Invitrogen, cat. no. 1OOO3D) coated 
with anti-histone H3 antibody (Abcam, cat. no. ab1791) overnight at 
4°C.The immune complexes were washed twice with the following 
buffers; ChIP-lysis buffer, high-salt lysis buffer (ChIP-lysis buffer + 
360 mM NaCl), ChIP-wash buffer (250 mM LiCl, 10 mM Tris pH 8.0, 
0.5% Na-deoxycholate, 0.5% NP-40, 1mM EDTA) and 1x TE(20 mM Tris 
pH 8.0, 2 mM EDTA). The protein-DNA complexes were eluted from 
the beads in 250 ul elution buffer (1% SDS, 50 mM Tris pH 8.0, 10 mM 
EDTA) at 65 °C for 20 min followed by the addition of proteinase K to 
500 pg/ml and overnight incubation at 65 °C. Input DNA was isolated 
from sheared chromatin input (1/100 of the material used for ChIP). 
For sequencing, IP and input ChIP sequencing (ChIP-seq) libraries 
were prepared according to the manufacturer’s protocols for the lon 
Proton sequencer (Thermo Fisher Scientific/Life Technologies). In brief, 
10 ng of ChIP DNA was end repaired and adaptor ligated using the KAPA 


Library Preparation Kit for lon Torrent (KAPA Biosystems) and adap- 
tor barcode Kapa Barcode Adaptors 9-24. After adaptor ligation, each 
sample was size selected using AMPure XP Bead (Beckman Coulter). 
An amplification reaction was set up ina final volume of 50 pl. A SPRI 
cleanup with a1.5x bead:DNA ratio was performed after amplifica- 
tion and final libraries were eluted in 35 pl. Libraries were quantified 
ona Qubit fluorometer with HS DNA (Thermo Fisher Scientific/Life 
Technologies) and checked for size on an Agilent Bioanalyzer with 
an HS DNA kit (Agilent). Each size-selected library was diluted toa final 
concentration of 11 pM and clonally amplified using the lon Proton 
Hi-QTemplate Kit (Thermo Fisher Scientific/Life Technologies) with 
lonOneTouch 2 instrument (Thermo Fisher Scientific/Life Technolo- 
gies). After emulsion PCR, DNA-positive ion sphere particles (ISPs) 
were recovered and enriched according to standard protocols with 
the lonOneTouch ES Instrument (Thermo Fisher Scientific/Life Tech- 
nologies). A sequencing primer was annealed to DNA-positive ISPs and 
the sequencing polymerase bound, before loading of ISPs into Ion P1 
sequencing chips. Sequencing of the samples was conducted accord- 
ing to the Ion Proton Hi-Q Sequencing Kit protocol. One P1 sequencing 
chip with six libraries was loaded and run onan Ion Proton sequencer. 


RNA sequencing 

Total RNA was isolated from 5 x 10’ cells with the RNeasy Mini Kit (50) 
(Qiagen cat. no. 74104). Prior to library preparation, cytoplasmic and 
mitochondrial ribosomal RNA was removed using the Ribo-Zero Gold 
rRNA Removal Kit (Yeast) (Illumina, cat. no. MRZY1324). Libraries for 
RNA sequencing were prepared according to the manufacturer’s pro- 
tocols for transcriptome sequencing with the lon Proton sequencer 
(Thermo Fisher Scientific/Life Technologies). In brief, 11g total RNA was 
poly-A-selected using the Dynabeads mRNA Direct Micro Purification 
kit (Thermo Fisher Scientific, cat. no. 61021) according to the manu- 
facturer’s protocol. About 50 ng of poly-A RNA was used to prepare 
strand-specific barcoded RNA libraries with the lon Total RNA-Seq kit 
v.2.0 (Thermo Fisher Scientific, cat. no. 4475936). In brief, poly-A RNA 
was fragmented with RNase III and purified with Nucleic Acid Binding 
Beads. After purification, the poly-A RNA fragments were hybridized 
and ligated with lon Adaptor and subsequently reverse transcribed for 
cDNA preparation. cDNAs were amplified with lon Torrent barcoded 
primers and purified with Nucleic Acid Binding Beads. Final libraries 
were quantified on a Qubit fluorometer with HS DNA (Thermo Fisher 
Scientific) and checked for size on an Agilent Bioanalyzer with an HS 
DNA kit (Agilent). Four barcoded libraries were pooled together onan 
equimolar basis at a final concentration of 11 pM and clonally amplified 
using the lon Proton Hi-QTemplate Kit (Thermo Fisher Scientific, cat 
no. A26434) with lonOneTouch 2 instrument (Thermo Fisher Scien- 
tific/Life Technologies). After emulsion PCR, DNA-positive ISPs were 
recovered and enriched by standard protocols with the lonOneTouch ES 
Instrument (Thermo Fisher Scientific/Life Technologies). A sequencing 
primer was annealed to DNA-positive ISPs and the sequencing poly- 
merase bound, before loading of ISPs into lon P1 sequencing chips. 
Sequencing of the samples was conducted according to the Ion Pro- 
ton Hi-Q Sequencing Kit (Thermo Fisher Scientific, cat. no. A26433) 
Protocol on Ion Proton instrument. 


ChIA-PET 

We adopted the previously described method”’. Cells were cross-linked 
with 1% formaldehyde in culture medium for 30 min at room tempera- 
ture followed by quenching with 0.125 M glycine for 5 min. Cells were 
washed twice with ice-cold PBS and lysed in1 ml of lysis buffer (50 mM 
HEPES- KOH pH 7.5, 140 mM NaCl, 1 mM EDTA, 1% Triton X-100, 0.1% 
Na-deoxycholate) using Zirconia beads. Cross-linked chromatin was 
sheared to an average size of 500 bp by 6x 15-s pulses using a Biorupter 
sonicator. The lysate was then centrifuged to remove cell debris. 
The chromatin fraction was incubated with Dynabeads protein G 
beads (Invitrogen, cat no 10003D) coated with anti-Flag antibody 


(M2-antiflag, Sigma) overnight at 4 °C. The immune complexes were 
washed twice with the following buffers: ChIP-lysis buffer, high-salt 
lysis buffer (ChIP-lysis buffer + 360 mM NaCl), ChIP-wash buffer 
(250 mM LiCl, 10 mM Tris pH 8.0, 0.5% Na-deoxycholate, 0.5% NP-40,1 
mMEDTA) and 1x TE (20 mM Tris pH8.0, 2mM EDTA). Beads were pooled 
and end-repair was carried out using T4 DNA polymerase (NEB, cat. 
no. M0203 L) by rotating on a Intelli-Mixer at 37 °C for 40 min. Beads 
were washed 3x with ice-cold ChIA-PET wash buffer (10 mM Tris pH 
7.4,1mM EDTA, 500 mM NaC)). A-tailing was carried out using Kle- 
now fragment (3’-5’ exo-) (NEB, cat. no. MO212M) in the presence of 
100 uM dATP by rotating ona Intelli-Mixer at 37 °C for 50 min. Beads 
were washed 3x withice cold ChIA-PET wash buffer (10 mM Tris pH 7.4, 1 
mMEDTA, 500 mM NaCl). For proximity ligation, a bridge linker was pre- 
pared by annealing Linker-F and Linker-R (HPLC purified (250 nmole) 
from IDT (Integrated DNA Technologies): bridge linker-F: 5’- /SPhos/ 
CGCGATATC/iBIOdT/TATCTGACT -3’; bridge linker-R: 5’- /SPhos/GTCA- 
GATAAGATATCGCGCT -3’. 

Proximity ligation was carried out using T4 DNA ligase (NEB cat. 
no. MO202M), in the presence of bridge linker at a concentration of 
0.57 ng/pl by rotating at 16 °C overnight. Beads were washed once 
in ChIA-PET wash buffer and eluted in elution buffer (10 mM Tris pH 
8.0,1mM EDTA, 1% SDS) at 65 °C for 15 min. Reverse cross-linking was 
carried out at 65 °C in the presence of proteinase K. DNA was puri- 
fied using phenol-chloroform-isoamyl alcohol (pH 7.9) and MaxXtract 
High Density-2ml (QIAGEN, cat. no. 129056) and precipitated with 
isopropanol. Tagmentation of proximity-ligated DNA was carried out 
by Tn5 transposome using the Nextera DNA Sample Preparation Kit 
(24) (Illumina FC-121-1030). Tagmentated DNA was purified using the 
Zymo Genomic DNA Clean & Concentrator kit (Zymo Research, cat. 
no. D4014) and fragments containing linker DNA were enriched with 
Dynabeads M-280 Streptavidin (Invitrogen, cat. no.11205D). Beads 
were washed with 2 x SSC/0.5% (wt/vol) SDS five times and twice in 1x 
B&W buffer (10 mM Tris pH 8.0, 1 mMMEDTA, 1M NaCl). The sequencing 
library was amplified using beads and the purified library was used for 
paired-end sequencing using MiSeq Reagent Kit v3 (600 cycle) onan 
Illumina MiSeq instrument. 


Microarray and data processing 

Both IP and input DNA were amplified using the GenomePlex com- 
plete whole-genome amplification kit (Sigma, cat. no. WGA1-SORXN), 
biotin-labelled and hybridized to Affymetrix GeneChip S. cerevisiae 
Tiling 1.0R Array (ScO3b_MR) according to the Affymetrix standard 
protocol. The CEL files were processed using rMAT” R package to iden- 
tify enriched regions across the genome. At first, systematic biases 
such as probe effect were corrected by normalization. Then probe 
intensities were smoothed and a score was calculated for each probe 
using IP and input. To detect enriched regions based on the probe 
score, the following parameters were used; dMax = 300 (sliding win- 
dow side), nProbesMin = 8 (minimum number of probes to average), 
method = Score (calling enriched regions based on sliding widow 
scores), log, threshold = 1.5 (equal and greater than 1.5 are labelled 
as enriched regions). For bTMP experiments, bTMP binding ‘in cells’ 
(IP/input) was subtracted from the ‘naked genomic DNA’ score 
(IP/input) to correct for false positive binding of bTMP. 


Meta-gene analysis 

Meta-gene analysis was used to study the averaged enriched peak 
profile across all protein-coding genes (6,706 genes from SacCer 
2011 annotation) or aspecific set of genes upstream (-500 bases from 
TSS) and downstream (+500 bases from TTS) in the yeast genome. The 
peak scores were mapped using bedtools”™, for every base of the gene 
including upstream (-500 b) and downstream (+500b). The length of 
the gene was scaled to 1,000 bases. For scaling the ORF region to 1,000 
bases, the following equation was iterated for every base across all the 
genes (Z-x;)/(y;-x;)) x1,000 where x;is the start position of the ith gene, 
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i=(1,2,3... total genes), y,is the end position of the ith gene, i=(1,2,3... 
total genes), Zis base position (1, 2, 3....y,-x,). For the average intensity 
plot, the IP/input values of the normalized position (1,000 ORF, -500 
upstream and +500 downstream) of each gene were aggregated using 
median. For average gene density, the IP/input score was converted 
into a categorical value of either 1 or O based on the threshold of 1.5 
(21.5 is land <1.5 is O) of all the normalized positions (1,000 ORF, -500 
upstream and +500 downstream) of each gene and aggregated using 
thesum function. For visualization, average intensity and average gene 
density were plotted with respect to normalized ORF position. The 
points were smoothed using the generalized additive model (GAM) 
to obtain a curve using ggplot2 R package. 


RNA-seq data processing 

The RNA-seq data from the lonTorrent proton instrument contains 
approximately 25 million reads for each sample. The raw reads were 
filtered on the basis of quality value (-q 20 and —p 30) using the FASTX 
Toolkit. The filtered reads were aligned to the reference genome (Sac- 
Cer 2011) using STAR aligner™. Aligned BAM files were used for tran- 
script quantification (FPKM) using RSEM~”. The gene sets were divided 
into three equal categories (low, medium and high expression) accord- 
ing to FPKM values and used to plot the supercoiling, protein and RNA- 
DNA hybrid profile using the meta-gene calculation mentioned above. 


Histone H3 ChIP-seq data processing 

The ChIP-seq data from the IonTorrent proton instrument contain 
approximately 15 million reads for each sample. The raw reads were 
filtered on the basis of quality value (-q 20 and —p 30) using the FASTX 
Toolkit. The filtered reads were aligned to the reference genome 
(SacCer 2011) using TMAP aligner. The PCR duplicates were removed 
from the aligned BAM files using PICARD tools. The BAM files were 
sorted and indexed for the peak calling using SAMtools. The bedgraph 
files were generated by comparing bam files of IP and input (IP read 
coverage/input read coverage) resulting in a ratio for every base across 
the whole genome using deep Tools (bamCompare)”. Finally, peak call- 
ing was performed using the DANPOS (dpos) toolkit™ with the IP/input 
threshold 1.4 (-q1.4) where the output peaks corresponds to the indi- 
vidual nucleosome. The DANPOS was preferred over the MACS toolkit 
for the dynamic nucleosome analysis at single-nucleotide resolution. 


ChIA-PET data processing 

ChIA-PET data contain approximately 10 million reads with a median 
length of approximately 105 nucleotides. Raw reads were filtered on 
the basis of quality value (—q 20 and —p 30) using the FASTX Toolkit. 
The filtered reads were scanned for bridge linker (ACGCGATATCT- 
TATCTGACT, AGTCAGATAAGATATCGCGT) with a maximum of two 
mismatches using cutadapt. The reads containing the bridge linker 
were aligned to the reference genome (SacCer 2011) using the bwa mem 
module. PCR duplicates were removed using Picard Markduplicates 
module. The aligned bam file was converted to a bed pair end interac- 
tion file (bedpe) for cluster generation using bedtools (bamtobed) 
module. PETs with less than 1 kb distance (self-ligation loops) were not 
considered for the PET clustering. Individual PET interactions were 
clustered by extending each PET by 500 bp and PETs that overlapped 
at both ends were clustered together as a single PET cluster””. PET 
clusters with more than or equal to 2 were considered for meta-analysis. 
WashuU Epigenome Browser was used to visualize chromatin-chromatin 
interactions®. 


Tool kits 

FASTX Toolkit: http://hannonlab.cshl.edu/fastx_toolkit/ 
TMAP Toolkit: https://github.com/iontorrent/TMAP 
PICARD Toolkit: https://broadinstitute.github.io/picard/ 
BWA Toolkit: http://bio-bwa.sourceforge.net/ 


Statistics and reproducibility 

All experiments were carried out with two biological replicates. To test 
the significance of the overlap between two replicates (supercoiling, 
protein and hybrid peak calls), intersect and Fisher's exact test from 
bedtools were used. For bedtools intersect, aminimum of 80% overlap 
was expected for further downstream analysis such as meta-gene plot- 
ting. The number of overlap peaks and sum of overlap bases between 
two sets of intervals from bedtools were visualized using VennDiagram 
library from R. Protein-coding genes (n= 6,706) from SacCer 2011 were 
used for meta-gene plotting. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


All raw and processed data are available at the Gene Expression 
Omnibus (GEO) under the following accession numbers: GSE114410 
(bTMP, RNA-DNA hybrids, Top1 protein ChIP-on-chip and RPB3 pro- 
tein ChIP-on-chip); GSE114444 (RNA-seq, H3 ChIP-seq and ChIA-PET); 
GSE16258” (Top2 protein ChIP-chip, Hmol protein ChIP-chip and RPB3 
protein ChIP-chip). 


Code availability 


All the custom-made scripts used for this study are available in the 
GitHub repository at https://github.com/adhilmd/TopologyCusto- 
mAnalysis. 
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Extended Data Fig. 1| See next page for caption. 
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Extended Data Fig. 1| DNAsupercoil accumulation across the genome in 
wild-type conditions. a, Schematic representation of bTMP binding to DNA in 
asupercoil-dependent manner. Based on normalized bTMP (IP/input) values, 
the genomic region is categorized into negative, positive and stable regions. 
b, Genome browser profile for bTMP binding on chromosome III. On the basis 
of peak intensities, bTMP peaks were called for negative (red), stable (blue) and 
positive (green) regions. Right, expanded view of chromosome Ill from 15 to 
25kb. For the bTMP profile, positive value peaks (normalized IP/input) that 
were above the threshold (+1.5) were designated as ‘negative supercoil’ (-o), 
and negative value peaks (normalized IP/input) that were below the threshold 
(-1.5) were designated as ‘positive supercoil’ (+o). Peaks in between the 
thresholds (from -1.5 to +1.5) were considered stable regions. c, Pie charts 
showing the coverage of negative, positive and stable regions based on bTMP- 
ChIP values plotted as percentage coverage for whole genome, intergenic 


regions, protein-coding regions and nucleosome-occupied regions. d, Gene 
correlation plot for negative supercoil (percentage) accumulation for wild- 
type cells in Gland S phase (n= 6,706 genes; two-sided paired t-test, P< 2x10; 
Pearson correlation r=0.86).e, Venn diagram comparison of Gland S phase for 
bTMP binding with respect to peak number and base coverage. f, Genome 
browser view of the ASF2locus on chromosome IV, showing bTMP peaks and 
accumulation of Pol II (Rpb3-ChIP) in Gland S phase. g, Gal genes in 
chromosome ll (from 270 to 285 kb), depicting bT MP and Pol II (Rpb3-ChIP) 
binding profiles under glucose and galactose conditions in S phase. 

h, Expanded view of chromosome III from 90 to 96 kb, containing a highly 
active gene (LEU2) close to tRNA (tL(CAA)C) and two moderately expressed 
genes (NFS1 and DCC]1). Positive and negative supercoil and accumulation 

of Pol II (Rpb3-ChIP), Top1, Top2 and Hmolare shown. 
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Extended Data Fig. 2|See next page for caption. 
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Extended Data Fig. 2| Negative and positive supercoil distributionin DNA 
topoisomerase mutants. a, Pol II-coding genes were grouped into three 


categories; high, medium and low expression based on FPKM values from RNA- 


seq data (n=6,706 genes; low = medium = high = 2,235 genes). b, Positive 
supercoil distribution in high-, medium- and low-expression genes. c, Base 
coverage percentage of accumulation of Top2 and Topl accumulation at 
different intergenic spaces (<250 bp =1,729 gene pairs, 251-500 bp =2,224 
gene pairs and >500 bp =2,010 gene pairs) with respect to gene pairs grouped 
according to orientation. d, Accumulation of positive supercoil in wild-type 


and top2-1cellsinS phase. Pol II genes are plotted against average gene density 
onthe y-axis. e, Meta-gene plot for negative supercoil in G1 synchronized wild- 
type and top2-1 cells. f, Accumulation of positive supercoil in wild-type and 
top1A cells inS phase. g, Accumulation of positive supercoil in wild-type and 
top1Atop2-1 cells inS phase. h, Genome browser profile of chromosome Ill from 
90 to 96kb, showing comparative bTMP binding in wild-type cells and 
topoisomerase mutants. i, Genome browser profile of chromosome III from 90 
to 96 kb, showing Pol II (Rpb3-ChIP) and Top! protein accumulation in wild- 
type cells and top2-1 mutants. 
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Extended Data Fig. 3 | Negative and positive supercoil accumulationinhmol 
mutants. a, Positive supercoil accumulation in wild-type and hmo1A cellsinS 
phase. Pol II genes are plotted against average gene density on the y-axis. 

b, Positive supercoil accumulation in wild-type and hmoJAtop2-1cellsinS 
phase.c, Comparison of the accumulation of Top] protein (Top1-10~ Flag) in 
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wild-type, hmoIA and hmolAtop2-1 cells. d, Genome browser profile of 
chromosomelll from 90 to 96 kb, showing comparative bTMP binding in 
wild-type, hmoIA and hmolAtop2-1 cells inS phase. e, Genome browser profile 
of chromosome III from 90 to 96 kb, showing accumulation of Top1in wild- 
type, hmolIA, top2-1and hmol1Atop2-1 mutants. 
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Extended Data Fig. 4 | See next page for caption. 
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Extended Data Fig. 4 | RNA-DNA hybrid accumulation in wild-type and 
mutant cells. a, Meta-gene profiles for RNA-DNA hybrid comparison in wild- 
type cells (at 28 °C and 37 °C). b, Accumulation of RNA-DNA hybrids in different 
expression classes. Pol Il genes were grouped into three categories: high-, 
medium- and low-expression genes based on FPKM values from RNA-seq 
(Extended Data Fig. 2a). c, Meta-gene profiles for RNA-DNA hybrids in S phase 
in wild-type, rnh1A, rrm3A and sen!“ cells (a conditional lethal strain GAL::URL- 
HA-Sen1, which shows lethality in glucose). d, Meta-gene profiles for RNA-DNA 
hybrids in Glsynchronized wild-type and top2-1 cells. e, Meta-gene profile for 
RNA-DNA hybrid comparison in wild-type cells and top1Atop2-1 double- 


mutant cells. f, Meta-gene profiles for RNA-DNA hybrid comparison in wild- 
type, hmoIA, and hmoIAtop2-1 cells. g, Gene density plot comparison of RNA- 
DNA hybrids in wild-type, hmoIA and hmolAtop2-1 cells. h, Density plot 
showing the base coverage of RNA-DNA hybrids in genes with head-on or co- 
directional orientation with respect to replication fork. Genes within 1kb (top, 
n=347 genes) or 2 kb (bottom, n=539 genes) of the replication origins were 
considered. i, Base coverage percentage of accumulation of RNA-DNA hybrids 
at different intergenic spaces (<250 bp =1,729 gene pairs, 251-500 bp =2,224 
gene pairs and >500 bp =2,010 gene pairs) with respect to gene pairs grouped 
according to orientation. 
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Extended Data Fig. 5 | Ablation of negative supercoil at boundaries causes 
topological alterations and nucleosome repositioning. a, Genome browser 
profile of chromosome III from 90 to 96 kb, showing comparative bTMP 
binding in wild-type[Control], wild type[TopA], top1Atop2-1 [Control] and 
top1Atop2-1[TopA] cells after 120 min at restrictive temperature. b, Genome 
browser profile of chromosome V showing Hmol accumulation in wild-type, 
top1Atop2-1 [Control], and top1Atop2-1[TopA] cells after 120 min at restrictive 
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Nucleosome position (150bp intervals) with respect to TSS 


temperature. c, Absolute nucleosome count was derived from histone H3 
ChIP-seq analysis of wild-type [Control], wild type [TopA], top1Atop2-1 
[Control] and top1Atop2-1[TopA] cells after 120 min at restrictive temperature. 
d, Nucleosome positions (150-bp intervals) in wild-type cells and top1Atop2-1 
double mutants harbouring either control or TopA-expressing plasmids 
mapped with respect to TSS against Pol II gene percentage on the y-axis. 
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Extended Data Fig. 6 | Top2 mediates chromatin-chromatin interactions. number of interactions matching with previous Top2-binding sites, number 
a, Heat map showing PET clusters inall of the chromosomesat10-kbresolution. of interactions containing Top2 sites on either single or both sides of the 
Inset, enlarged view of chromosome II. b, Table showing the number of paired- interaction, number of genes within and outside the Top2-mediated chromatin 
end reads, reads with linker on both ends, unique mapped PETs and inter- loops and number of loops containing a single gene or multiple genes. Other 
ligation clusters obtained after Top2 ChIA-PET analysis. c, Density map categories include genes partially overlapping with the loops and all other 
showing PET density at different interactionlengths. Inletshowsenlargedview — elements. f, Genome browser view of ChIA-PET interactions along with 
of the region from 1,000 to 10,000 bp that was considered for analysis. d, Bar Top2- and Hmotl-binding sites inchromosomes Ill and XVI. 
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Extended Data Fig. 7 | See next page for caption. 


Extended Data Fig. 7 | A model for Hmol, Top2 and Top1-mediated 
topological dynamics at Pol II genes. a, Schematic representation of 
cruciform DNA structures arising at negatively supercoiled DNA formed by two 
branchesinaB-DNA duplex structure (red and black) and two branches ina 
non-B-DNA duplex conformation (red or black). Cruciform DNA structures 
could form at gene boundaries and be stabilized by Hmol. Gene looping is 
described inthe blue area. b, Schematic representation of gene loop structures 
in S phase. Top2 associates with gene boundaries to harmonize topological 
transactions when transcribed genes are approached by incoming replication 


forks. We note that the topological dynamics described in the twin topological 
domain model’ are not represented in our scheme. c, Inthe absence of Hmol, 
negative supercoil would lose the cruciform conformation and becomea 
substrate for Top2. Top2 defects would delocalize Top] at gene boundaries. 
The cruciform structures stabilized by Hmol would then become substrates 
for unscheduled Top1 activity that would convert them into aberrant 
intermediates such as single-stranded DNA, nicks and knots. Inhmoltop2 
double mutants, Top1is not recruited at the gene boundaries, which remainina 
negative supercoil context. 
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Extended Data Fig. 8 | bTMP titration. a, b, By using a biotin-labelled oligoasa 
reference point, we titrated in vivo binding of bTMP in yeast at different 
concentrations (from 0 to 800 ng). We diluted 4.16 pmol of biotin- 

labelled oligos into 200 pl of TE. The shown volumes of oligos were spotted 
ontoahybond membrane after equilibration with 1x maleic acid. Genomic DNA 
was isolated after UV (365 nM) cross-linking with the respective amount of 
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bTMP and about 300 ng (a) or 1,500 ng (b) was spotted on the blot. The last 
sample was kept as a negative control for UV cross-linking. In Panel B, fivefold 
more oligo and genomic DNA was spotted compared to Panel A. Dot blot was 
developed with ExtrAvidin-Peroxidase antibody (Sigma-Aldrich, cat no: 
E2886). b, By measuring dot blot intensities, we estimated that 400 pg of bTMP 
was needed for 2 x 10” yeast cells. 
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Extended Data Fig. 9 | bTMP binding normalization and dispersion profile 
for bTMP. a, To avoid bias, normalization was performed to filter potential 
sequence-specific psoralen DNA binding”’. To normalize the data, we first 
purified and sonicated genomic DNA and then performed the bTMP-ChIP 
procedure on purified DNA fragments. The correction for microarray readings 
was done by subtracting bTMP binding in vivo from bTMP binding on purified/ 
sonicated genomic DNAas follows: (bTMP cells —IP/input) — (bTMP purified 
DNA - IP/input), which gives the normalized ratio of bTMP (bTMP - IP/input) 


binding. The meta ORF plot shows bTMP profiles with or without control DNA 
normalization of bTMP in wild-type S phase cells. b,c, Meta-gene plot showing 
the normalized mean bTMP ratio (b) and median bTMP (c) ratio in wild-typeS 
phase cells. For both the plots, bTMP binding ratios for all protein-coding genes 
were plotted without smoothing, along with upper and lower confidence 
intervals (a=0.05 or 95% limit). Dotted lines represent upper and lower 
confidence intervals (a= 0.05 or 95% limit). The confidence interval does not 
deviate significantly from the mean and median values. 
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Extended Data Table 1| List of S. cerevisiae strains 


Strain Stock Number 
1 Wt SY2080 
2 top2-1 CY8423 
3 hmo1A CY8476 
4 top2-1hm01A CY8475 
5 top1A cy9950 
6 Top1-6XHis-10xFlag CY7178 
7 top2-1,Top1-6XHis-10xFlag cy7411 
8 top2-1top1A cY10344 
9 hmo1A,Top1-6XHis-10xFlag cy15215 
A a -6XHis- 
10 hmo1Atop2-1,Top1-6XHis-10x CY15216 
Flag 
11 Rpb3-10X-Flag cy15214 
12 Wt [control] cY15421 
13 Wt [TopA] cy15422 
14 top2-1top1A [control] cY15423 
: A 3 
16 top2-1top14,Hmo1-10x Flag, [ CY15427 
Control] 
5 A i 
v7 top2-1top14,Hmo1-10x Flag, [ cY15428 


TopA] 


Genotype 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+ 

MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RAD5+, top2-1 

MATa, ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 112 trp1- 
1, RAD5+ ,hmo1::HIS 

MATa, ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 112 trp1- 
1, RADS+, hmo1::HIS, top2-1 

MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RAD5+, top1::HIS 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+, ura3::URA3/GPD-TK(7X), 
top1-6His10Flag (KANr) 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+, ura3::URA3/GPD-TK(7X), 
top1-6His10Flag (KANr), top2-1 


MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RADS5+, top2-1, top1::HIS 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+, hmo1::Hygromycin, top1- 
6His10Flag (KANr) 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+, top1-6His10Flag (KANr), 
top2-1, hmo1::Hygromycin 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 
15, can1-100, GAL, PSI+, RAD5+, HIS3::BrdU-Inc, rpb3::RPB3- 


10X Flag-KanMxX6 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RAD5+ [pYEp13-LEU empty] 


Mata, ade2-1, ura3-1, trp1-1, leu2-3, leu2-112, his3-11, his3- 


15, can1-100, GAL, PSI+, RADS5+ [pJRW13-YEptopA-pGPD- 
LEU] 


MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RAD5+, top2-1, top1::HIS [pYEp13-LEU empty] 


MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RAD5+, top2-1, top1::HIS [pJRW13-YEptopA-pGPD-LEU] 
MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RADS+, top2-1, top1::HIS, HMO1::HMO1-10X Flag [pYEp13- 
LEU empty] 


MATa ADE2+ CAN1+, ura3-1, his3-11,15 leu2-3, 12 trp1-1, 
RADS+, top2-1, top1::HIS, HMO1::HMO1-10X Flag [pJRW13- 
YEptopA-pGPD-LEU] 
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For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
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n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
LJ AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection No software was used 


Data analysis rMAT (Droit A, 2010) (v3.33.0) 

STAR aligner (Dobin A, 2013) 
RSEM (Dewey, 2011) (v1.2.31) 
SAMTOOLS (v 1.9) 
DEEPTOOLS (Ramirez et al., 2014) (v3.2.0) 
DANPOS toolkit (Chen et al., 2013) (v2.2.2) 
BEDTOOLS (v2.29.0) 
FASTX Toolkit: http://hannonlab.cshl.edu/fastx_toolkit/ (v0.0.14) 
TMAP Toolkit: https://github.com/iontorrent/TMAP (v3.4.0) 
PICARD Toolkit: https://broadinstitute.github.io/picard/ (v2.14.1) 
ACS2 Toolkit (v2.1.2) 
BWA Toolkit: http://bio-bwa.sourceforge.net/ (vO.7.17-r1188) 
ggplot2 R package (v 3.2.1) 


All the custom-made scripts used for this study are made available in the GitHub repository 
https://github.com/adhilmd/TopologyCustomAnalysis 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


NCBI GEO accession ID: GSE114410 (This study) Contains bTMP, RNA:DNA hybrids, Protein ChIP-chip RAW & processed data 
NCBI GEO accession ID: GSE114444 (This study) Contains RNA-seq, H3 ChIP-seq RAW, ChIA-PET RAW & processed data 
NCBI GEO accession ID: GSE16258 (Bermejo et al., 2009) Contains Top2, Hmo1, RPB3 protein ChIP-chip data 
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Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size 6706 RNA pol Il transcribed genes 
173 replication origins 


Data exclusions No data excluded 
Replication All experiments were performed with biological replicates 
Randomization — Not relevant to this study, as samples were yeast strain specific 


Blinding Not relevant to this study, as samples were yeast strain specific 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 
Antibodies 
Antibodies used anti-Flag (M2-antiflag) Antibody (Sigma-Aldrich cat no F3165, clone: M2, Lot#: SLBQ6349V) 
S9.6 Monoclonal RNA:DNA Antibody (Boguslawski et al.,1986) 
anti-Hstone H3 antibody (Abcam cat no 1791, Lot#:GR3236305-1) x 
ExtrAvidin-Peroxidase antibody (Sigma-Aldrich cat no: E2886, Lot#:015M4844V, For dot blot 1:4000 dilution is used ) g 
9 
® 
Validation S9.6 Monoclonal RNA:DNA Antibody (Boguslawski et al.,1986) is validated and used in several publications including: El Hage, A., N 


0 


et al. (2010), Chan, Y. A., et al. (2014), Wahba, L., et al. (2016), Vanoosthuyse, V. (2018), El Hage, A. and D. Tollervey (2018) 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links NCBI GEO accession ID: GSE114444 (This study) Contains H3 ChIP-seq, Hmoltag protein ChIP-seq, ChIA-PET RAW & 
May remain private before publication. processed data 
Files in database submission GSM3664900 ChIP-seq: WT-Controlplasmid-nucleosome-Input 


GSM3664901 ChIP-seq: WT-Controlplasmid-nucleosome-I|P 

GSM3664902 ChIP-seq: Top2-1-Top1-Controlplasmid-nucleosome-Input 
GSM3664903 ChIP-seq: Top2-1-Top1-Controlplasmid-nucleosome-IP 
GSM3664904 ChIP-seq: WT-TopAplasmid-nucleosome-Input 

GSM3664905 ChIP-seq: WT-TopAplasmid-nucleosome-I|P 

GSM3664906 ChIP-seq: Top2-1-Top1-TopAplasmid-nucleosome-Input 
GSM3664907 ChIP-seq: Top2-1-Top1-TopAplasmid-nucleosome-IP 
GSM3664908 ChIP-seq: Hmo1tag-Top2-1-Top1-Controlplasmid-protein-Input 
GSM3664909 ChIP-seq: Hmo1tag-Top2-1-Top1-Controlplasmid-protein-IP 
GSM3664910 ChIP-seq: Hmo1tag-Top2-1-Top1-TopAplasmid-protein-Input 
GSM3664911 ChIP-seq: Hmo1tag-Top2-1-Top1-TopAplasmid-protein-|IP 
GSM4094803 ChIA-PET: Top2-ChIA-PET 
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Genome browser session i) H3 ChIP-seq (Nucleosome): 
(e.g. UCSC) http://epigenomegateway.wustl.edu/legacy/?genome=sacCer3&session=2IU7XHKkwi&statusld=1279418941 

Note: Use the Nucleosome_TopA Session: First and second track is WT-control plasmid (bed density and bedgraph), third 
and fourth track is WT-TopA plasmid (bed density and bedgraph), fifth and sixth track is Top2-1-Top1-Control plasmid (bed 


density and bedgraph), seventh and eighth track is Top2-1-Top1-TopA plasmid (bed density and bedgraph) 


ii) Protein ChIP-seq: 
http://epigenomegateway.wustl.edu/legacy/?genome=sacCer3 &session=4FQoSLC1lk&statusld=1043438118 

Note: Use the TopA_Hmo1 Session: First track is Hmo1tag-Top2-1-Top1-Controlplasmid and Second track is Hmoltag- 
Top2-1-Top1-TopAplasmid) 


iii) ChIA-PET: 

http://epigenomegateway.wustl.edu/legacy/?genome=sacCer3 &session=arHkgZiCZP&statusl|d=302725081 

Note: Use the ChIA-PET-Data Session: First track is Hmo1 protein binding regions, Second track is Top2 protein binding 
regions, Third track is ChIA-PET data for Top2-10X Flag and Fourth track is the interaction points for ChIA-PET Top2-10X Flag) 


o longer applicable for final submission. 


Methodology 


Replicates One biological replicate for each sample is generated and analyzed for consistency. (Replicates were not yet deposited in the 
GEO) 
Sequencing depth i) H3 ChIP-seq (Nucleosome): Average Reads: ~15 Million Reads, Average Read length: ~180bp, Average Mapped Reads: ~8 


illion Reads and single end reads (lon Torrent Platform) 
ii) Protein ChIP-seq: Average Reads: ~15 Million Reads, Average Read length: ~180bp, Average Mapped Reads: ~8 Million 
Reads and single end reads (lon Torrent Platform) 


Antibodies i) H3 ChIP-seq (Nucleosome): anti-Hstone H3 antibody (Abcam cat no 1791) 
ii) Protein ChIP-seq: anti-Flag (M2-antiflag) Antibody (Sigma-Aldrich cat no F3165) 
iii) ChIA-PET: anti-Flag (M2-antiflag) Antibody (Sigma-Aldrich cat no F3165) 


Peak calling parameters i) H3 ChIP-seq: The raw reads were filtered based on the quality value (-q 20 and —p 30) using FASTX Toolkit. The filtered 
reads were aligned to the reference genome (SacCer 2011) using TMAP aligner. The PCR duplicates were removed from the 
aligned BAM files using PICARD tools. The BAM files were sorted and indexed for the peak calling using SAMTOOLS. The 
bedgraph files were generated by comparing bam files of IP and Input (IP read coverage/Input read coverage) result in the 
ratio for every base across the whole genome using DEEPTOOLS (bamCompare) (Ramirez et al., 2014). Finally, peak calling 
was performed using DANPOS (dpos) toolkit (Chen et al., 2013) with the IP/Input threshold 1.4 (-q 1.4) where the output 
peaks corresponds to the individual nucleosome. The DANPOS was preferred over MACS toolkit for the dynamic nucleosome 
analysis at single nucleotide resolution. 
(ii) Protein ChIP-seq: The raw reads were filtered based on the quality value (-q 20 and —p 30) using FASTX Toolkit. The 
filtered reads were aligned to the reference genome (SacCer 2011) using TMAP aligner. The PCR duplicates were removed 
from the aligned BAM files using PICARD tools. Finally, MACS2 tool is used for peak calling with the following parameters (-f 
BAM --gsize=1.21e+7 -n ctrl-A -B -p 0.01 --nomodel --extsize 200 --broad). 


ow 


Data quality The peak fold enrichment were in the range of 1.5 to 3.0 


Software 


SAMTOOLS (v 1.9) 

DEEPTOOLS (Ramirez et al., 2014) (v3.2.0) 

DANPOS toolkit (Chen et al., 2013) (v2.2.2) 

BEDTOOLS (v2.29.0) 

FASTX Toolkit: http://hannonlab.cshl.edu/fastx_toolkit/ (v0.0.14) 
TMAP Toolkit: https://github.com/iontorrent/TMAP (v3.4.0) 
PICARD Toolkit: https://broadinstitute.github.io/picard/ (v2.14.1) 
MACS2 Toolkit (v2.1.2) 

BWA Toolkit: http://bio-bwa.sourceforge.net/ (v0O.7.17-r1188) 
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Protein structure prediction can be used to determine the three-dimensional shape of 
a protein from its amino acid sequence’. This problem is of fundamental importance 
as the structure of a protein largely determines its function’; however, protein 
structures can be difficult to determine experimentally. Considerable progress has 
recently been made by leveraging genetic information. It is possible to infer which 
amino acid residues are in contact by analysing covariation in homologous 
sequences, which aids in the prediction of protein structures*. Here we show that we 
can train a neural network to make accurate predictions of the distances between 
pairs of residues, which convey more information about the structure than contact 
predictions. Using this information, we construct a potential of mean force’ that can 
accurately describe the shape of a protein. We find that the resulting potential can be 
optimized by a simple gradient descent algorithm to generate structures without 
complex sampling procedures. The resulting system, named AlphaFold, achieves high 
accuracy, even for sequences with fewer homologous sequences. In the recent Critical 
Assessment of Protein Structure Prediction’ (CASP13)—a blind assessment of the state 


of the field—AlphaFold created high-accuracy structures (with template modelling 
(TM) scores’ of 0.7 or higher) for 24 out of 43 free modelling domains, whereas the 
next best method, which used sampling and contact information, achieved such 
accuracy for only 14 out of 43 domains. AlphaFold represents a considerable advance 
in protein-structure prediction. We expect this increased accuracy to enable insights 
into the function and malfunction of proteins, especially in cases for which no 
structures for homologous proteins have been experimentally determined’. 


Proteins are at the core of most biological processes. As the function of 
a protein is dependent on its structure, understanding protein struc- 
tures has been a grand challenge in biology for decades. Although 
several experimental structure determination techniques have been 
developed and improved in accuracy, they remain difficult and time- 
consuming”. As aresult, decades of theoretical work has attempted to 
predict protein structures from amino acid sequences. 

CASP?* is a biennial blind protein structure prediction assessment 
run by the structure prediction community to benchmark progress in 
accuracy. In 2018, AlphaFold joined 97 groups from around the worldin 
entering CASP13°. Each group submitted up to 5 structure predictions 
for each of 84 protein sequences for which experimentally determined 
structures were sequestered. Assessors divided the proteins into 104 
domains for scoring and classified each as being amenable to template- 
based modelling (TBM, in which a protein with a similar sequence has 
a known structure, and that homologous structure is modified in 
accordance with the sequence differences) or requiring free model- 
ling (FM, in cases in which no homologous structure is available), with 


an intermediate (FM/TBM) category. Figure 1a shows that AlphaFold 
predicts more FM domains with high accuracy than any other system, 
particularly in the 0.6-0.7 TM-score range. The TM score—ranging 
between O and 1—measures the degree of match of the overall (back- 
bone) shape ofa proposed structure to a native structure. The assessors 
ranked the 98 participating groups by the summed, capped z-scores of 
the structures, separated according to category. AlphaFold achieved 
asummed z-score of 52.8 in the FM category (best-of-five) compared 
with 36.6 for the next closest group (322). Combining FM and TBM/FM 
categories, AlphaFold scored 68.3 compared with 48.2. AlphaFold is 
able to predict previously unknown folds to high accuracy (Fig. 1b). 
Despite using only FM techniques and not using templates, AlphaFold 
also scored well in the TBM category according to the assessors’ for- 
mula O-capped z-score, ranking fourth for the top-one model or first 
for the best-of-five models. Much of the accuracy of AlphaFold is due 
to the accuracy of the distance predictions, which is evident from the 
high precision of the corresponding contact predictions (Fig. 1c and 
Extended Data Fig. 2a). 
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Fig. 1| The performance of AlphaFold in the CASP13 assessment. a, Number 
of FM (FM+FM/TBM) domains predicted for a given TM-score threshold for 
AlphaFold and the other 97 groups. b, For the six new folds identified by the 
CASP13 assessors, the TMscore of AlphaFold was compared with the other 
groups, together with the native structures. The structure of T1017s2-D1is not 
available for publication. c, Precisions for long-range contact predictionin 


The most-successful FM approaches thus far? “ have relied on frag- 
ment assembly. In these approaches, a structure is created through 
a stochastic sampling process—such as simulated annealing”—that 
minimizes a statistical potential that is derived from summary statistics 
extracted from structures in the Protein Data Bank (PDB). In fragment 
assembly, a structure hypothesis is repeatedly modified, typically by 
changing the shape of a short section while retaining changes that lower 
the potential, ultimately leading to low potential structures. Simu- 
lated annealing requires many thousands of such moves and must be 
repeated many times to have good coverage of low-potential structures. 

Inrecent years, the accuracy of structure predictions has improved 
through the use of evolutionary covariation data“ that are found in sets 
of related sequences. Sequences that are similar to the target sequence 
are found by searching large datasets of protein sequences derived 
from DNA sequencing and aligned to the target sequence to generate 
amultiple sequence alignment (MSA). Correlated changes in the posi- 
tions of two amino acid residues across the sequences of the MSA can be 
used to infer which residues might be in contact. Contacts are typically 
defined to occur when the B-carbon atoms of 2 residues are within 8 A 
of one another. Several methods’, including neural networks” ~*, 
have been used to predict the probability that a pair of residues is in 
contact based on features computed from MSAs. Contact predictions 
are incorporated in structure predictions by modifying the statistical 
potential to guide the folding process to structures that satisfy more 
of the predicted contacts"”’. Other studies“ have used predictions 
of the distance between residues, particularly for distance geometry 
approaches”, Neural network distance predictions without covari- 
ation features were used to make the evolutionary pairwise distance- 
dependent statistical potential”, which was used to rank structure 
hypotheses. In addition, the QUARK pipeline” used a template-based 
distance-profile restraint for TBM. 

Inthis study, we present a deep-learning approach to protein struc- 
ture prediction, the stages of which are illustrated in Fig. 2a. We show 
that it is possible to construct a learned, protein-specific potential 
by training a neural network (Fig. 2b) to make accurate predictions 
about the structure of the protein given its sequence, and to predict 
the structure itselfaccurately by minimizing the potential by gradient 
descent (Fig. 2c). The neural network predictions include backbone 
torsion angles and pairwise distances between residues. Distance 
predictions provide more specific information about the structure 
than contact predictions and provide a richer training signal for the 
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CASP13 for the most probable, L/2 or L/S contacts, where Z is the length of the 
domain. The distance distributions used by AlphaFold in CASP13, thresholded 
to contact predictions, are compared with the submissions by the two best- 
ranked contact prediction methods in CASP13: 498 (RaptorX-Contact”*) and 
032 (TripletRes*”) on ‘all groups’ targets, with updated domain definitions for 
T0953s2. 


neural network. By jointly predicting many distances, the network 
can propagate distance information that respects covariation, local 
structure and residue identities of nearby residues. The predicted 
probability distributions can be combined to formasimple, principled 
protein-specific potential. We show that with gradient descent, it is 
simple to find a set of torsion angles that minimizes this protein-specific 
potential using only limited sampling. We also show that whole chains 
can be optimized simultaneously, avoiding the need to segment long 
proteins into hypothesized domains that are modelled independently 
as is common practice (see Methods). 

The central component of AlphaFold is a convolutional neural 
network that is trained on PDB structures to predict the distances 
d, between the C, atoms of pairs, ij, of residues of a protein. On the 
basis of a representation of the amino acid sequence, S, of a protein 
and features derived from the MSA(S) of that sequence, the network, 
whichis similar in structure to those used for image-recognition tasks”, 
predicts a discrete probability distribution P(d,|S, MSA(S)) for every 
ij pair in any 64 x 64 region of the L x L distance matrix, as shown in 
Fig. 2b. The full set of distance distribution predictions constructed 
by combining such predictions that covers the entire distance map is 
termed a distogram (from distance histogram). Example distogram 
predictions for one CASP protein, T0955, are shown in Fig. 3c, d. The 
modes of the distribution (Fig. 3c) can be seen to closely match the 
true distances (Fig. 3b). Example distributions for all distances to one 
residue (residue 29) are shown in Fig. 3d. We found that the predictions 
of the distance correlate well with the true distance between residues 
(Fig. 3e). Furthermore, the network also models the uncertainty in its 
predictions (Fig. 3f). When thes.d. of the predicted distribution is low, 
the predictions are more accurate. This is also evident in Fig. 3d, in 
which more confident predictions of the distance distribution (higher 
peak and lowers.d. of the distribution) tend to be more accurate, with 
the true distance close to the peak. Broader, less-confidently predicted 
distributions still assign probability to the correct value even when it 
is not close to the peak. The high accuracy of the distance predictions 
and consequently the contact predictions (Fig. 1c) comes from acom- 
bination of factors in the design of the neural network and its training, 
data augmentation, feature representation, auxiliary losses, cropping 
and data curation (see Methods). 

To generate structures that conform to the distance predictions, 
we constructed a smooth potential Voistance by fitting a spline to the 
negative log probabilities, and summing across all of the residue pairs 
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Fig. 2| The folding process illustrated for CASP13 target TO986s2. CASP 
target T0986s2, =155, PDB: 6N9V.a, Steps of structure prediction. b, The 
neural network predicts the entire Z x L distogram based on MSA features, 
accumulating separate predictions for 64 x 64-residue regions. c, One iteration 
of gradient descent (1,200 steps) is shown, with the TM score and root mean 
square deviation (r.m.s.d.) plotted against step number with five snapshots of 
the structure. The secondary structure (from SST**) is also shown (helix in blue, 
strand in red) along with the native secondary structure (Nat.), the secondary 


(see Methods). We parameterized protein structures by the backbone 
torsion angles (@, f) of all residues and build a differentiable model of 
protein geometry x = G(g, p) to compute the C, coordinates, x; for all 
residues i and thus the inter-residue distances, d; = ||x;— x;||, for each 
structure, and express Viicrance aS a function of g and w. For a protein with 
Lresidues, this potential accumulates L? terms from marginal distribu- 
tion predictions. To correct for the overrepresentation of the prior, we 
subtract a reference distribution” from the distance potential in the log 
domain. The reference distribution models the distance distributions 
P(d,|length) independent of the protein sequence and is computed 
by training a small version of the distance prediction neural network 
on the same structures, without sequence or MSA input features. 
Aseparate output head of the contact prediction network is trained to 
predict discrete probability distributions of backbone torsion angles 
P(@;,)|S, MSA(S)). After fitting a von Mises distribution, this is used to 
add asmooth torsion modelling term, Vo,sion, to the potential. Finally, 
to prevent steric clashes, we add the V,core2 smooth SCOre Of Rosetta’ to the 
potential, as this incorporates a van der Waals term. We used multipli- 
cative weights for each of the three terms in the potential; however, no 
combination of weights noticeably outperformed equal weighting. 

As all of the terms in the combined potential V,,..(@, W) are 
differentiable functions of (@, w), it can be optimized with respect to 
these variables by gradient descent. Here we use L-BFGS”. Structures 
are initialized by sampling torsion values from P(@;, W|S, MSA(S)). 
Figure 2c illustrates a single gradient descent trajectory that minimizes 
the potential, showing how this greedy optimization process leads to 
increasing accuracy and large-scale conformation changes. The sec- 
ondary structure is partly set by the initialization from the predicted 
torsion angle distributions. The overall accuracy (TM score) improves 
quickly and after a few hundred steps of gradient descent the accuracy 
of the structure has converged to a local optimum of the potential. 
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structure prediction probabilities of the network and the uncertainty in 
torsion angle predictions (as x’ of the von Mises distributions fitted to the 
predictions for gand yw). While each step of gradient descent greedily lowers 
the potential, large global conformation changes are effected, resulting ina 
well-packed chain. d, The final first submission overlaid onthe native structure 
(in grey). e, The average (across the test set, n=377) TMscore of the lowest- 
potential structure against the number of repeats of gradient descent per 
target (log scale). 


We repeated the optimization from sampled initializations, 
leading to a pool of low-potential structures from which further struc- 
ture initializations are sampled, with added backbone torsion noise 
(‘noisy restarts’), leading to more structures to be added to the pool. 
After only a few hundred cycles, the optimization converges and the 
lowest potential structure is chosen as the best candidate structure. 
Figure 2e shows the progress in the accuracy of the best-scoring struc- 
tures over multiple restarts of the gradient descent process, show- 
ing that after a few iterations the optimization has converged. Noisy 
restarts enable structures witha slightly higher TM score to be found 
than when continuing to sample from the predicted torsion distribu- 
tions (average of 0.641 versus 0.636 on our test set, shown in Extended 
Data Fig. 4). 

Figure 4a shows that the distogram accuracy (measured using the 
local distance difference test (IDDT,,) of the distogram; see Meth- 
ods) correlates well with the TM score of the final realized structures. 
Figure 4b shows the effect of changing the construction of the potential. 
Removing the distance potential entirely gives a TM score of 0.266. 
Reducing the resolution of the distogram representation below six bins 
by averaging adjacent bins causes the TM score to degrade. Removing 
the torsion potential, reference correction OF Vécorez smooth degrades the 
accuracy only slightly. A final ‘relaxation’ (side-chain packing inter- 
leaved with gradient descent) with Rosetta’, using a combination of 
the Talaris2014 potential and a spline fit of our reference-corrected 
distance potential adds side-chain atom coordinates, and yields a small 
average improvement of 0.007 TM score. 

We show that a carefully designed deep-learning system can pro- 
vide accurate predictions of inter-residue distances and can be used 
to construct a protein-specific potential that represents the protein 
structure. Furthermore, we show that this potential can be optimized 
with gradient descent to achieve accurate structure predictions. 
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Fig. 3 | Predicted distance distributions compared with true distances. 

a-d, CASP target T0955, / = 41, PDB SW9F. a, Native structure showing 

distances under 8A fromthe C, of residue 29. b, c, Native inter-residue 

distances (b) and the mode of the distance predictions (c), highlighting residue 
29.d, The predicted probability distributions for distances of residue 29 to all 
other residues. The bin corresponding to the native distance is highlighted in 
red, 8 Ais drawnin black. The distributions of the true contacts are plotted in 
green, non-contacts in blue. e, f, CASP target T0990, L = 552, PDB 6N9OV. 


Whereas FM predictions only rarely approach the accuracy of experi- 
mental structures, the CASP13 assessment shows that the AlphaFold 
system achieves unprecedented FM accuracy and that this FM method 
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Fig. 4 | TMscores versus the accuracy of the distogram, and the dependency 
ofthe TMscore on different components of the potential. a, TM score versus 
distogram IDDT,, with Pearson’s correlation coefficients, for both CASP13 

(n=500:5 decoys for all domains, excluding T0999) and test (n = 377) datasets. 
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0.25 Abins. The true distance matrix and distogram for T0990 are shownin 
Extended Data Fig. 2b, c. 


can match the performance of template-modelling approaches without 
using templates and is starting to reach the accuracy needed to provide 
biological insights (see Methods). We hope that the methods we have 
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described can be developed further and applied to benefit all areas 
of protein science with more accurate predictions for sequences of 
unknown structure. 
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Methods 


Extended Data Figure 1a shows the steps involved in MSA construction, 
feature extraction, distance prediction, potential construction and 
structure realization. 


Tools 

The following tools and dataset versions were used for the CASP sys- 
tem and for subsequent experiments: PDB 15 March 2018; CATH 16 
March 2018; HHblits based on v.3.0-beta.3 (three iterations, F=1* 10°); 
HHpred web server; Uniclust30 2017-10; PSI-BLAST v.2.6.0 nr dataset 
(as of 15 December 2017) (three iterations, F=1* 10°); SST web server 
(March 2019); BioPython v.1.65; Rosetta v.3.5; PyMol 2.2.0 for structure 
visualization; TM-align 20160521. 


Data 

Our models are trained on structures extracted from the PDB”. 
We extract non-redundant domains by utilizing the CATH** 35% 
sequence similarity cluster representatives. This generated 31,247 
domains, which were split into train and test sets (29,427 and 1,820 
proteins, respectively), keeping all domains from the same homologous 
superfamily (H-level in the CATH classification) in the same partition. 
The CATH superfamilies of FM domains from CASP11 and 
CASP12 were also excluded from the training set. From the test set, we 
took—at random—a single domain per homologous superfamily to 
create the 377 domain subset used for the results presented here. We 
note that accuracies for this set are higher than for the CASP13 test 
domains. 

CASP13 submission results are drawn from the CASP13 results pages 
with additional results shown for the CASP13 dataset for ‘all groups’ 
chains, scored on CASP13 PDB files, by CASP domain definitions. Con- 
tact prediction accuracies were recomputed from the group 032 and 
498 submissions (as RR files), compared with the distogram predictions 
used by AlphaFold for CASP13 submissions. Contact prediction prob- 
abilities were obtained from the distograms by summing the probability 
mass in each distribution below 8 A. 

For each training sequence, we searched for and aligned to the train- 
ing sequence similar protein sequences in the Uniclust30* dataset 
with HHblits* and used the returned MSA to generate profile features 
with the position-specific substitution probabilities for each residue 
as well as covariation features—the parameters of a regularized pseu- 
dolikelihood-trained Potts model similar to CCMpred’®. CCMPred uses 
the Frobenius norm of the parameters, but we feed both this norm 
(1 feature) and the raw parameters (484 features) into the network for 
each residue pair ij. In addition, we provide the network with features 
that explicitly represent gaps and deletions in the MSA. To make the 
network better able to make predictions for shallow MSAs, and as a 
form of data augmentation, we take a sample of half the sequences 
from the the HHblits MSA before computing the MSA-based features. 
Our training set contains 10 such samples for each domain. We extract 
additional profile features using PSI-BLAST”. 

The distance prediction neural network was trained with the follow- 
ing input features (with the number of features indicated in brackets). 

¢ Number of HHblits alignments (scalar). 

¢ Sequence-length features: 1-hot amino acid type (21 features); 

profiles: PSI-BLAST (21 features), HHblits profile (22 features), 
non-gapped profile (21 features), HHblits bias, HMM profile (30 
features), Potts model bias (22 features); deletion probability (1 fea- 
ture); residue index (integer index of residue number, consecutive 
except for multi-segment domains, encoded as 5 least-significant 
bits andascalar). 

¢ Sequence-length-squared features: Potts model parameters 

(484 features, fitted with 500 iterations of gradient descent using 
Nesterov momentum 0.99, without sequence reweighting); 
Frobenius norm (1 feature); gap matrix (1 feature). 


The z-scores were taken from the results CASP13 assessors (http:// 
predictioncenter.org/casp13/zscores _final.cgi?formula=assessors). 


Distogram prediction. The inter-residue distances are predicted by 
a deep neural network. The architecture is a deep two-dimensional 
dilated convolutional residual network. Previously, atwo-dimensional 
residual network was used that was preceded by one-dimensional em- 
bedding layers for contact prediction”. Our network is two-dimensional 
throughout and uses 220 residual blocks” with dilated convolutions*’. 
Each residual block, illustrated in Extended Data Fig. 1b, consists of a 
sequence of neural network layers” that interleave three batchnorm 
layers; two 1x 1 projection layers; a3 x 3 dilated convolution layer and 
exponential linear unit (ELU)*° nonlinearities. Successive layers cycle 
through dilations of 1, 2, 4, 8 pixels to allow propagation of informa- 
tion quickly across the cropped region. For the final layer, a position- 
specific bias was used, such that the biases were indexed by residue- 
offset (capped at 32) and bin number. 

The network is trained with stochastic gradient descent using a 
cross-entropy loss. The target is a quantification of the distance 
between the C, atoms of the residues (or C, for glycine). We divide 
the range 2-22 A into 64 equal bins. The input to the network consists 
of atwo-dimensional array of features in which each i,j feature is the 
concatenation of the one-dimensional features for both iand/as well 
as the two-dimensional features for i,j. 

Individual training runs were cross-validated with early stopping 
using 27 CASP11 FM domains as a validation set. Models were selected 
by cross-validation on 27 CASP12 FM domains. 


Neural network hyperparameters 

-7 groups of 4 blocks with 256 channels, cycling through dilations 
1,2,4,8. 

- 48 groups of 4 blocks with 128 channels, cycling through dilations 
1,2,4,8. 

* Optimization: synchronized stochastic gradient descent 

+ Batch size: batch of 4 crops on each of 8 GPU workers. 

* 0.85 dropout keep probability. 

¢ Nonlinearity: ELU. 

* Learning rate: 0.06. 

¢ Auxiliary loss weights: secondary structure: 0.005; accessible sur- 
face area: 0.001. These auxiliary losses were cut by a factor 10 after 
100 000 steps. 

* Learning rate decayed by 50% at 150,000, 200,000, 250,000 and 
350,000 steps. 

¢ Training time: about 5 days for 600,000 steps. 


Cropped distograms. To constrain memory usage and avoid overfit- 
ting, the network was always trained and tested on 64 x 64 regions 
of the distance matrix, that is, the pairwise distances between 64 
consecutive residues and another group of 64 consecutive residues. 
For each training domain, the entire distance matrix was split into 
non-overlapping 64 x 64 crops. By training off-diagonal crops, the 
interaction between residues that are further apart than 64 residues 
could be modelled. Each crop consisted of the distance matrix that 
represented the juxtaposition of two 64-residue fragments. It has 
previously been shown” that contact prediction needs only a limited 
context window. We note that the distance predictions close to the 
diagonal i=j, encode predictions of the local structure of the protein, 
and for any cropped region the distances are governed by the local 
structure of the two fragments represented by the iand/ ranges of the 
crop. Augmenting the inputs with the on-diagonal two-dimensional 
input features that correspond to both the i andj ranges provides 
additional information to predict the structure of each fragment and 
thus the distances between them. It can be seen that if the fragment 
structures can be well predicted (for instance, if they are confidently 
predicted as helices or sheets), then the prediction ofa single contact 
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between the fragments will strongly constrain the distances between 
all other pairs. 

Randomizing the offset of the crops each time a domain is used in 
training leads to a form of data augmentation in which a single pro- 
tein can generate many thousands of different training examples. 
This is further enhanced by adding noise proportional to the ground- 
truth resolution to the atom coordinates, leading to variation in the 
target distances. Data augmentation (MSA subsampling and coordinate 
noise), together with dropout", prevents the network from overfitting 
to the training data. 

To predict the distance distribution for all L x L residue pairs, many 
64 x 64 crops are combined. To avoid edge effects, several such tilings 
are produced with different offsets and averaged together, with a heav- 
ier weighting for the predictions near the centre of the crop. To improve 
accuracy further, predictions from an ensemble of four separate 
models, trained independently with slightly different hyperparameters, 
are averaged together. Extended Data Figure 2b, c shows examples of 
the true distances and the mode of the distogram predictions fora 
three-domain CASP13 target, T0990. 

As the network has a rich representation capable of incorporat- 
ing both profile and covariation features of the MSA, we argue that 
the network can be used to predict the secondary structure directly. 
By mean- and max- pooling the two-dimensional activations of the 
penultimate layer of the network separately in bothiand/, we add an 
additional one-dimensional output head to the network that predicts 
eight-class secondary structure labels as computed by DSSP“ for each 
residue inj and i. The resulting accuracy of the Q3 (distinguishing the 
three helix/sheet/coil classes) predictions is 84%, which is comparable 
to the state-of-the-art predictions*. The relative accessible surface 
area (ASA) of each residue can also be predicted. 

The one-dimensional pooled activations are also used to predict the 
marginal Ramachandran distributions, P(@,, W;|S, MSA(S)), indepen- 
dently for each residue, as a discrete probability distribution approxi- 
mated to 10° (1,296 bins). In practice during CASP13 we used distograms 
from a network that was trained to predict distograms, secondary 
structure and ASA. Torsion predictions were taken from asecond similar 
network trained to predict distograms, secondary structure, ASA and 
torsions, as the former had been more thoroughly validated. 

Extended Data Figure 3b shows that an important factor in the accu- 
racy of the distograms (as has previously been found with contact 
prediction systems) is Nog, the effective number of sequences in the 
MSA”. This is the number of sequences found inthe MSA, discounting 
redundancy at the 62% sequence identity level, which we then divide by 
the number of residues in the target, and is an indication of the amount 
of covariation information in the MSA. 


Distance potential. The distogram probabilities are estimated for 
discrete distance bins; therefore, to construct a differentiable potential, 
the distribution is interpolated with a cubic spline. Because the final 
bin accumulates probability mass from all distances beyond 22 A, and 
as greater distances are harder to predict accurately, the potential was 
only fitted up to 18 A (determined by cross-validation), with a constant 
extrapolation thereafter. Extended Data Figure 3c (bottom) shows the 
effect of varying the resolution of the distance histograms on structure 
accuracy. 

To predict a reference distribution, a similar model is trained onthe 
same dataset. The reference distribution is not conditioned on the 
sequence, but to account for the atoms between which we are predict- 
ing distances, we do provide a binary feature 6,, to indicate whether 
the residue is a glycine (C, atom) or not (C,) and the overall length of 
the protein. 

A distance potential is created from the negative log likelihood of 
the distances, summed over all pairs of residues i, j (Supplementary 
equation (1)). With a reference state, this becomes the log-likelihood 


ratio of the distances under the full conditional model and under the 
background model (Supplementary equation (2)). 

Torsions are modelled as a negative log likelihood under the pre- 
dicted torsion distributions. As we have marginal distribution predic- 
tions, each of which can be multimodal, it can be difficult to jointly 
optimize the torsions. To unify all of the probability mass, at the cost 
of modelling fidelity of multimodal distributions, we fitted a unimodal 
von Mises distribution to the marginal predictions. This potential was 
summed over all residues i (Supplementary equation (3)). 

Finally, to prevent steric clashes, a van der Waals term was introduced 
through the use of Rosetta’s Vecore2 smooth: EXtended Data Figure 3c (top) 
shows the effect on the accuracy of the structure prediction of different 
terms in the potential. 


Structure realization by gradient descent. To realize structures that 
minimize the constructed potential, we created a differentiable model 
ofideal protein backbone geometry, giving backbone atom coordinates 
as a function of the torsion angles (@, W): x = G(@, W). The complete 
potential to be minimized is then the sum of the distance, torsion and 
score2_smooth (Supplementary equation (4)). Although there is no 
guarantee that these potentials have equivalent scale, scaling param- 
eters on the terms were introduced and chosen by cross-validation 
on CASP12 FM domains. In practice, equal weighting for all terms was 
found to lead to the best results. 

As every term in V,,,, is differentiable with respect to the torsion 
angles, given an initial set of torsions g, w, which can be sampled 
from the predicted torsion marginals, we can minimize V,,,,, using a 
gradient descent algorithm, such as L-BFGS*. The optimized struc- 
ture is dependent on the initial conditions, so we repeat the optimi- 
zation multiple times with different initializations. A pool of the 20 
lowest-potential structures is maintained and once full, we initialize 
90% of trajectories from those with 30° noise added to the backbone 
torsions (the remaining 10% still being sampled from the predicted 
torsion distributions). In CASP13, we obtained 5,000 optimization 
runs for each chain. Figure 2c shows the change in TM score against 
the number of restarts per protein. As longer chains take longer to 
optimize, this work load was balanced across (50 + L)/2 parallel work- 
ers. Extended Data Figure 4 shows similar curves against computation 
time, always comparing sampling starting torsions from the predicted 
marginal distributions with restarting from the pool of previous 
structures. 


Accuracy. We compare the final structures to the experimentally 
determined structures to measure their accuracy using metrics such 
as TM score, GDT_TS (global distance test, total score**) and r.m.s.d. 
Allofthese accuracy measures require geometric alignment between 
the candidate structure and the experimental structure. An alterna- 
tive accuracy measure that requires no alignment is the IDDT*, which 
measures the percentage of native pairwise distances D, under 15 A, 
with sequence offsets > rresidues, that are realized in a candidate struc- 
ture (as d,) within a tolerance of the true value, averaging across toler- 
ances of 0.5, 1,2 and 4 A (without stereochemical checks), as shownin 
Supplementary equation (5)). 

As the distogram predicts pairwise distances, we can introduce dis- 
togram IDDT (DLDDT), a measure similar to IDDT that is computed 
directly from the probabilities of the distograms, as shown in Sup- 
plementary equation (6)). As distances between residues nearby in 
the sequence are often short, easier to predict and are not critical in 
determining the overall fold topology, we set r=12, considering only 
those distances for residues with a sequence separation >12. Because 
we predict C, distances, for this study we computed both IDDT and 
DLDDT using the C, distances. Extended Data Figure 3a shows that 
DLDDT,, has high correlation (Pearson’s r= 0.92 for CASP13) with the 
IDDT,, of the realized structures. 


Full chains without domain segmentation. Parameterizing proteins 
of length L by two torsion angles per residue, the dimension of the 
space of structures grows as 2/; thus, searching for structures of large 
proteins becomes much more difficult. Traditionally this problem 
was addressed by splitting longer protein chains into pieces—termed 
domains—that fold independently. However, domain segmentation 
from the sequence alone is itself difficult and error-prone. For this 
study, we avoided domain segmentation and folded entire chains. 
Typically, MSAs are based ona given domain segmentation; however, 
we used a sliding window approach, computing a full-chain MSA to 
predict a baseline full-sequence distogram. We then computed MSAs 
for subsequences of the chain, trying windows of size 64, 128, 256 with 
offsets at multiples of 64. Each of these MSAs gave rise to an individual 
distogram that corresponded to an on-diagonal square of the full-chain 
distogram. We averaged all of these distograms together, weighted by 
the number of sequences in the MSA to produce an average full-chain 
distogram that is more accurate in regions in which many alignments 
can be found. For the CASP13 assessment, full chains were relaxed with 
Rosetta relax with a potential Of Veaiarisoo14 + 0.2 Veistance (Weighting deter- 
mined by cross-validation) and submissions from all of the systems 
were ranked based on this potential. 


CASP13 results. For CASP13, the five AlphaFold submissions were from 
three different systems, all of which used potentials based on the neural 
network distance predictions. The systems that are not described here 
are described ina separate paper®. Before T0975, two systems based on 
simulated annealing and fragment assembly (and using 40-bin distance 
distributions) were used. From T0975 onward, newly trained 64-bin 
distogram predictions were used and structures were generated by the 
gradient descent system described here (three independent runs) as 
well as one of the fragment assembly systems (five independent runs). 
The five submissions were chosen from these eight structures (the 
lowest potential structure generated by each independent run) with 
the first submission (top-one) being the lowest-potential structure 
generated by gradient descent. The remaining four submissions were 
the four best other structures, with the fifth being a gradient descent 
structure if none had been chosen for position 2, 3 or 4. All submis- 
sions for T0999 were generated by gradient descent. Extended Data 
Figure 5a shows the methods used for each submission, comparing with 
‘back-fill’ structures generated by a single run of gradient descent for 
targets before T0975. Extended Data Figure 5b shows that the gradient 
descent method that was used later in CASP performed better than 
the fragment assembly method, in each category. Extended Data Fig- 
ure 5c compares the accuracy of the AlphaFold submissions for FM and 
FM/TBM domains with the next best group 322. The assessors of CASP13 
FMused expert visual inspection** to choose the best submissions for 
each target and found that AlphaFold had nearly twice as many best 
models as the next best group. 


Biological relevance of AlphaFold predictions. There is a wide range 
of uses of predicted structures, all with different accuracy require- 
ments, from generally understanding the fold shape to understand- 
ing detailed side-chain configurations in binding regions. Contact 
predictions alone can guide biological insights”, for instance, to 
target mutations to destabilize the protein. Figure 1c and Extended 
Data Fig. 2a show that the accuracy of the contact predictions from 
AlphaFold exceeds that of the state-of-the-art predictions. In Extended 
Data Figs. 6-8, we present further results that show that the accuracy 
improvements of AlphaFold lead to more accurate interpretations of 
function (Extended Data Fig. 6); better interface prediction for pro- 
tein—-protein interactions (Extended Data Fig. 7); better binding pocket 
prediction (Extended Data Fig. 8) and improved molecular replacement 
in crystallography. 


Thus far only template-based predictions have been able to deliver 
the most accurate predictions. Although AlphaFold is able to match 
TBM without using templates, and in some cases outperform other 
methods (for example, T0981-D5, 72.8 GDT_TS, and T0957s1-D2, 88.0 
GDT_TS, two TBM-hard domains for which the top-one model of Alpha- 
Foldis 12 GDT_TS better than any other top-one submission), the accu- 
racy for FM targets still lags behind that for TBM targets and can still 
not be relied on for the detailed understanding of hard structures. In an 
analysis of the performance of CASP13 TBM predictions for molecular 
replacement, another study*’ reported that the AlphaFold predic- 
tions (raw coordinates, without B-factors) led to a marginally greater 
log-likelihood gain than those of any other group, indicating that 
these improved structures can assist in phasing for X-ray crystallog- 
raphy. 


Interpretation of distogram neural network. We have shown that 
the deep distance prediction neural network achieves high accuracy, 
but we would like to understand how the network arrives at its dis- 
tance predictions and—in particular—to understand how the inputs 
to the model affect the final prediction. This might improve our un- 
derstanding of the folding mechanisms or suggest improvements to 
the model. However, deep neural networks are complex nonlinear 
functions of their inputs, and so this attribution problem is difficult, 
under-specified and an on-going topic of research. Even so, there 
are anumber of methods for such analysis: here we apply Integrated 
Gradients” to our trained distogram network to indicate the location 
of input features that affect the network’s predictions ofa particular 
distance. 

In Extended Data Fig. 9, plots of summed absolute Integrated Gradi- 
ent, > 1S; |, (defined in Supplementary equations (7)-(9)) are shown 
for selected /,/ output pairs in TO986s2; and in Extended Data Fig. 10, the 
top-10 highest attribution input pairs for each output pair are shown 
ontop of thetop-one predicted structure of AlphaFold. The attribution 
mapsare sparse and highly structured, closely reflecting the predicted 
geometry of the protein. For the four in-contact pairs presented (1, 2, 
3,5), all of the highest attribution pairs are pairs within or between the 
secondary structure that one or both of the output pair(s) are members 
of.In1, the helix residues are important as well as connections between 
the strands that follow either end of the helix, which might indicate 
strain onthe helix. In 2, all of the most important residue pairs connect 
the same two strands, whereas in 3, a mixture of inter-strand pairs and 
strand residues is most salient. In 5, the most important pairs involve 
the packing of nearby secondary structure elements to the strand and 
helix. For the non-contacting pair, 4, the most important input pairs 
are the residues that are geometrically between /and/Jin the predicted 
protein structure. Furthermore, most of the high-attribution input 
pairs are themselves in contact. 

Asthe network is tasked with predicting the spatial geometry, with no 
structure available at the input, these patterns of interaction indicate 
that the network is using intermediate predictions to discover impor- 
tant interactions and channelling information from related residues 
to refine the final prediction. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Our training, validation and test data splits (CATH domain codes) are 
available from https://github.com/deepmind/deepmind-research/tree/ 
master/alphafold_casp13. The following versions of public datasets 
were used in this study: PDB 2018-03-15; CATH 2018-03-16; Uniclust30 
2017-10; and PSI-BLAST nr dataset (as of 15 December 2017). 
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Code availability 


Source code for the distogram, reference distogram and torsion 
prediction neural networks, together with the neural network weights 
and input data for the CASP13 targets are available for research and 
non-commercial use at https://github.com/deepmind/deepmind- 
research/tree/master/alphafold_casp13. We make use of several 
open-source libraries to conduct our experiments, particularly 
HHblits**, PSI-BLAST” and the machine-learning framework Tensor- 
Flow (https://github.com/tensorflow/tensorflow) along with the Ten- 
sorFlow library Sonnet (https://github.com/deepmind/sonnet), which 
provides implementations of individual model components”. We also 
used Rosetta’ under license. 
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Extended Data Fig. 1| Schematics of the folding system and neural network. 
a, The overall folding system. Feature extraction stages (constructing the MSA 
using sequence database search and computing MSA-based features) are 
shown in yellow; the structure-prediction neural network in green; potential 
construction in red; and structure realization in blue. b, The layers used in one 
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block of the deep residual convolutional network. The dilated convolution is 
applied to activations of reduced dimension. The output of the block is added 
tothe representation from the previous layer. The bypass connections of the 
residual network enable gradients to pass back through the network 
undiminished, permitting the training of very deep networks. 
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Extended Data Fig. 2 | CASP13 contact precisions. a, Precisions (as shownin 
Fig. 1c) for long-range contact prediction in CASP13 for the most probable ZL, L/2 
orL/5 contacts, where L is the length of the domain. The distance distributions 
used by AlphaFold (AF) in CASP13, thresholded to contact predictions, are 
compared with submissions by the two best-ranked contact prediction 
methods in CASP13: 498 (RaptorX-Contact?*) and 032 (TripletRes”), on ‘all 


groups’ targets, with updated domain definitions for T0953s2. b,c, True 
distances (b) and modes of the predicted distogram (c) for CASP13 target 
T0990. CASP divides this chain into three domains as shown (D3 is inserted in 
D2) for which there are 39, 36 and 42 HHbiits alignments, respectively (from the 
CASP website). 
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Extended Data Fig. 3| Analysis of structure accuracies. a, |IDDT,, versus 
distogram IDDT,, (see Methods, ‘Accuracy’). The distogram accuracy predicts 
the IDDT of the realized structure well (particularly for medium- and long-range 
residue pairs, as well as the TM score as shown in Fig. 4a) for both CASP13 
(n=500:5 decoys for domains excluding T0999) and test (n= 377) datasets. 
Data are shown with Pearson’s correlation coefficients. b, DLDDT,, against the 
effective number of sequences in the MSA (N+) normalized by sequence length 
(n=377). The number of effective sequences correlates with this measure of 
distogram accuracy (r=0.634).c, Structure accuracy measures, computed on 
the test set (n = 377), for gradient descent optimization of different forms of the 
potential. Top, removing terms in the potential, and showing the effect of 
following optimization with Rosetta relax. ‘P’ shows the significance of the 


potential giving different results from ‘Full’, for a two-tailed paired data t-test. 
‘Bins’ shows the number of bins fitted by the spline before extrapolation and 
the number in the full distribution. In CASP13, splines were fitted to the first 51 
of 64 bins. Bottom, reducing the resolution of the distogram distributions. The 
original 64-bin distogram predictions are repeatedly downsampled by a factor 
of 2 by summing adjacent bins, in each case with constant extrapolation 
beyond 18 A (the last quarter of the bins). The two-level potential in the final 
row, which was designed to compare with contact predictions, is constructed 
by summing the probability mass below8 A and between 8 and 14 A, with 
constant extrapolation beyond 14 A. The TMscores in this table are plotted in 
Fig. 4b. 
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Extended Data Fig. 4 | TMscore versus per-target computation time 
computed as an average over the test set. Structure realization requires a 
modest computation budget, which can be parallelized over multiple 
machines. Full optimization with noisy restarts (orange) is compared with 
initialization from sampled torsions (blue). Computation is measured as the 
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product of the number of (CPU-based) machines and time elapsed and canbe 
largely parallelized. Longer targets take longer to optimize. Figure 2e shows 
how the TM score increases with the number of repeats of gradient descent. 
n=377., 
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Extended Data Fig. 5 | AlphaFold CASP13 results. a, The TMscore for each of 
the five AlphaFold CASP13 submissions are shown. Simulated annealing with 
fragment assembly entries are shown in blue. Gradient-descent entries are 
shown in yellow. Gradient descent was only used for targets T0975 and later, so 
tothe left of the black line we also show the results for a single ‘back-fill’ run of 
gradient descent for each earlier target using the deployed system. T0999 
(1,589 residues) was manually segmented based on HHpred*™ homology 
matching. b, Average TM scores of the AlphaFold CASP13 submissions 

(n=104 domains), comparing the first model submitted, the best-of-five model 
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(submission with highest GDT_TS), asingle run of full-chain gradient descent 

(a CASP13 run for T0975 and later, back-fill for earlier targets) and asingle 
CASP13 run of fragment assembly with domain segmentation (using a gradient 
descent submission for T0999). c, The formula-standardized (z) scores of the 
assessors for GDT TS + QCS”, best-of-five for CASP FM (n=31) and FM/TBM 
(n=12) domains comparing AlphaFold with the closest competitor (group 322), 
coloured by domain category. AlphaFold performs better (P= 0.0032, one- 
tailed paired statistic t-test). 
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Extended Data Fig. 6 | Correct fold identification by structural searchin 
CATH. Often protein function can be inferred by finding homologous proteins 
of known function. Here we show that the FM predictions of AlphaFold give 
greater accuracy ina structure-based search for homologous domains in the 
CATH database. For each of the FM or TBM/FM domains, the top-one 
submission and ground truth are compared to all 30,744 CATH S40 non- 
redundant domains with TM-align®. For the 36 domains for which there is a 


good ground-truth match (score > 0.5), we show the percentage of decoys for 
whicha domain with the same CATH code (CATH in red, CA in green; CAT results 
are close to CATH results) as the top ground-truth match is in the top-k matches 
with score > 0.5. Curves are shown for AlphaFold and the next-best group (322). 
AlphaFold predictions determine the matching fold more accurately. 
Determination of the matching CATH domain can provide insights into the 
function of anew protein. 
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Extended Data Fig. 7 | Accuracy of predictions for interfaces. Protein— 
protein interaction is an important domain for understanding protein function 
that has hitherto largely been limited to template-based models because of the 
need for high-accuracy predictions, although there has been moderate 
success* in docking with predicted structures up to 6 Ar.m.s.d. This figure 
shows that the predictions by AlphaFold improve accuracy in the interface 
regions of chains in hetero-dimer structures and are probably better 
candidates for docking, although docking did not form part of the AlphaFold 
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system and all submissions were for isolated chains rather than complexes. For 
the five all-groups heterodimer CASP13 targets, the full-atom r.m.s.d. values of 
the interface residues (residues with a ground-truth inter-chain heavy-atom 
distance <10 A) are computed for the chain submissions of all groups (green), 
relative to the target complex. Results >8 A are not shown. AlphaFold (blue) 
achieves consistently high accuracy interface regions and, for 4 out of S 
targets, predicts interfaces below <5 A for bothchains. 
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Extended Data Fig. 8 | Ligand pocket visualizations for T1011. T1011 (PDB true pocket than that of the best other submission (322, model 3, 68.7 GDT TS) 
6M97T) is the EP3 receptor bound to misoprostol-FA®. a, The native structure (c). Both submissions are aligned to the native protein using the same subset of 
showing the ligand ina pocket. b, c, Submission 5 (78.0 GDT TS) by AlphaFold residues from the helices close to the ligand pocket and visualized with the 


(b), made without knowledge of the ligand, shows a pocket more similar to the interior pocket together with the native ligand position. 
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Extended Data Fig. 9 | Attribution map of distogram network. The contact 
probability map of T0986s2, and the summed absolute value of the Integrated 
Gradient, dS ijcl, of the input two-dimensional features with respect tothe 
expected distance between five different pairs of residues (/,/): (1) a helix self- 


contact, (2) along-range strand-strand contact, (3) amedium-range strand- 
strand contact, (4) anon-contact and (5) a very long-range strand-strand 
contact. Each pair is shown as two red dots on the diagrams. Darker colours 
indicate a higher attribution weight. 
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Extended Data Fig. 10 | Attribution shown on predicted structure. For lighter green colours indicate more sensitive, and the output pair is shownasa 
T0986s2(TMscore 0.8), the top 10 input pairs, including self-pairs, with the blueline. 

highest attribution weight for each of the five output pairs shown in Extended 

Data Fig. 9, are shownas lines (or spheres for self-pairs) coloured by sensitivity, 


natureresearch ee eisai snes 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 
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When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
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The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 
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Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
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For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


O Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 


Software and code 


Policy information about availability of computer code 


Data collection To ingest and pre-process the data, we used the following tools: PDB 2018-03-15 ; CATH 2018-03-16; HHblits based on version 3.0- 
beta.3; HHpred web server; Uniclust30 2017-10; PSI-BLAST version 2.6.0; SST web server (March 2019); BioPython v1.65; Rosetta v3.5; 
TM-align 20160521, as well as custom code written using Python 2.7. See the methods section for more details. 


Data analysis The networks used the TensorFlow library with custom extensions. Inference code for distance prediction networks used in CASP13 will 
be open-sourced. Analysis was performed with custom code written in Python 2.7. Visualizations were made with PyMol 2.2.0 software. 
Please see methods section for more detail. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The following public datasets were used in this work: 


- PDB 2018-03-15 
- CATH 2018-03-16 
- Uniclust30 2017-10 


- PSI-BLAST nr dataset (as of 2017-12-15) 
We will make available our train/test split (CATH domain codes). 


Field-specific reporting 


Please select the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


[X] Life sciences 


[_] Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/authors/policies/ReportingSummary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size 


Data exclusions 


Replication 


Randomization 


Blinding 


Tests were carried out on two sets of domains generated by predetermined methods. 

The principal set consisted of 104 CASP13 domains as segmented by the CASP13 assessors. (In some figures 100 domains are used, excluding 
the domains of T0999 where slightly different methods were used; or 41 considering FM + TBM/FM targets) 

For the CASP13 datasets, accuracy measures are reported using structures & distance distributions computed during CASP (except for the 
"back-fill" gradient descent structures where specified). With n=41 domains, we show (Extended Data Figure 5) that (with p=0.0032) 
AlphaFold's results were better than the next best group. 

The other test set of 377 domains was extracted from PDB, chosen to be from separate homologous superfamilies (CATH code) none of which 
were represented in the training set. For each superfamily the exemplar was chosen at random from the s35 cluster representatives. 
Extended Data Table 3c shows that this sample size is sufficient to show that certain terms in the potential (but not score2) make a difference 
in the final GDT_TS score. 


The training set used one example per s35 cluster, using the representative chosen by CATH. 

We further balance (and reduce the size of) the test set by keeping one example per superfamily. 

CASP13 domains are only those "all groups" targets which were scored as part of CASP13. 

Some CASP13 targets were excluded by the assessors e.g. because of publications or failure to solve a structure. 


Actual competition entries were used and were not replicated, but we show comparisons for top-5 decoys. 

Multiple networks were trained independently and were found to give consistent results (on CASP11/12 validation sets). 4 such networks, 
yielding similar results, were used in CASP and are made available with the source code. 

Structure generation is stochastic, but we show (Extended Data Fig 4) that it converges with relatively few attempts, so results are 
reproducible. 


Superfamilies were randomly assigned to PDB train or PDB test set. 

For each superfamily in the test set, a random s35 cluster representative was picked for the 377 example test set. 

CASP13 assessors used a manual process (blind to the investigators) to determine evaluation units and to determine whether each domain 
should be treated as FM, TBM or TBM/FM. 


Investigators were blind to the CASP13 targets which were sequestered during the assessment. 
Investigators were not blind to the PDB train/test split, but cross-validation used CASP11 & 12 domains and the main results are on the blind 
CASP13. Models were not retrained after defining the 377 domain set. 
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SAGA (Spt-Ada-Gcn5~acetyltransferase) is a19-subunit complex that stimulates 
transcription via two chromatin-modifying enzymatic modules and by delivering the 
TATA box binding protein (TBP) to nucleate the pre-initiation complex on DNA, a 
pivotal event in the expression of protein-encoding genes’. Here we present the 
structure of yeast SAGA with bound TBP. The core of the complex is resolved at 3.5A 
resolution (0.143 Fourier shell correlation). The structure reveals the intricate 
network of interactions that coordinate the different functional domains of SAGA and 
resolves an octamer of histone-fold domains at the core of SAGA. This deformed 
octamer deviates considerably from the symmetrical analogue in the nucleosome and 
is precisely tuned to establish a peripheral site for TBP, where steric hindrance 
represses binding of spurious DNA. Complementary biochemical analysis points toa 
mechanism for TBP delivery and release from SAGA that requires transcription factor 
IIA and whose efficiency correlates with the affinity of DNA to TBP. We provide the 
foundations for understanding the specific delivery of TBP to gene promoters and the 


multiple roles of SAGA in regulating gene expression. 


Transcription of protein-encoding genes begins with the formation ofa 
pre-initiation complex (PIC) comprising RNA polymerase Il and several 
general transcription factors”. PIC assembly is nucleated by loading TBP 
onto promoter DNA’, a focal point for regulated gene expression‘. Two 
multiprotein complexes, transcription factor IID (TFIID) and SAGA, 
can deliver TBP to gene promoters and are required for global gene 
expression in yeast’®. 

The 1.6-MDa SAGA complex also stimulates transcription via its two 
chromatin-modifying enzymatic activities. The complex comprises 
19 subunits organized in four modules with distinct functions”: a his- 
tone acetyltransferase (HAT) module, a histone deubiquitinase (DUB) 
module, the 430-kDa Tral subunit that serves as a docking platform for 
transcription factors that recruit SAGA to activating DNA sequences 
upstream of the promoter, and a 10-subunit central module that is 
physically connected to all other modules andis responsible for recruit- 
ing TBP to SAGA (Extended Data Fig. 1a). 

Structures of several individual isolated components of SAGA, nota- 
bly the DUB module", the catalytic subunit of the HAT module (Gcn5)” 
and Tral®"* have been obtained. Previous electron microscopy studies 
positioned Tral in one lobe of the SAGA complex, but all other subunits 
appeared together as a main lobe that could only be described at low 
resolution®”’. 

The central module of yeast SAGA shares five TBP-associated factors 
(TAFs) with TFIID. Both complexes also contain multiple histone-fold 
domains that associate into heterodimers and higher-order oligomers” 
(Extended Data Fig. 1a, b). The central module is therefore expected to 
be structurally related to TFIID, suggesting that TBP binding in both 
complexes shares some common features. Progress in defining the 


architecture of TFIID was reported recently; however, the machinery 
responsible for TBP binding and delivery remained poorly resolved”. 

Here we report the 3.5 A-resolution structure of the full SAGA com- 
plex bound to TBP, determined from cryo-electron microscopy (cryo- 
EM) maps that show clear density for the majority of the protein side 
chains. Our structure describes in atomic detail the network of inter- 
actions that coordinate the functional domains of SAGA as well as a 
machinery precisely geared to binding TBP and preventing aberrant 
transcription initiation. With complimentary experimental findings, 
our structure points to a mechanism of TBP delivery and release from 
SAGA that depends on TFIIA and the affinity of promoter DNA to TBP, 
providing new insights into regulation of PIC assembly. 


Overall architecture of SAGA 


We reconstituted the complex with SAGA purified from the yeast 
Komagataella phaffii (also known as Pichia pastoris), full length TBP 
and ubiquitin aldehyde and collected a dataset of more than one mil- 
lion particles. Image analysis resulted in a map of the full SAGA com- 
plex at 3.8 A resolution, calculated from 338,850 selected particles 
(Fig. la, Extended Data Figs. 1c, d,2, Supplementary Video 1, Extended 
Data Table 1). Focused refinement of the main lobe and of Tral further 
improved their average resolution to 3.5 A, with many regions reaching 
as highas3 A. Hence, Tral and the central module, as well as the domains 
of Sgf73 that anchor the DUB to the central module, show clear densi- 
ties for the majority of side chains (Fig. 1b, Extended Data Fig. 2b, c). 
At the periphery of SAGA, exposed next to Spt3, we identified the 
distinctive saddle-shaped density of the conserved core of TBP (cTBP), 
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PA IW) © , 
Histone fold octamer 
Fig. 1| Structure of the yeast SAGA-TBP complex. a, Front (top) and side 
(bottom) views of acomposite cryo-EM reconstruction of the yeast SAGA-T BP 
complex. Maps from focused refinements of Tral (tan), DUB (light pink), Spt8 
(white), the main lobe and TBP (blue-green) were combined. HAT density 
(light green) is taken from the low-pass-filtered map of the full complex. 
b, Corresponding views of the atomic model of the SAGA-TBP complex 
focusing onthe main lobe.c, The Taf5 WD40 domain structurally orchestrates 
the main lobe. Orientation as ina (top). d, Intricate network of protein contacts 
with the TafS WD40 domain (white). 


which was absent ina separate map of SAGA lacking TBP (Extended Data 
Fig. 3a—d). Local classification revealed sufficient secondary struc- 
tural features to guide the precise positioning of cTBP. Site-specific 
cross-linking and documented mutations resolved any ambiguity in 
orientation arising from the pseudosymmetric fold of cCTBP”’. A disc- 
shaped density neighbouring TBP was assigned to the WD40 protein 
Spt8, which was shown to interact with TBP”? (Extended Data Fig. 3e, 
f). Atthe opposite pole of the main lobe, the DUB crystal structure fits 
into and occupies a large but not well-resolved density that is nearly 
completely detached from and independent of SAGA. The DUB mod- 
ule is connected to the main lobe by a narrow, elongated density that 
contacts the surface of SAGA proximal to the Spt20 SEP domain and the 
LisH domain of Taf5 (Fig. 1a, b, Extended Data Fig. 1). The HAT enzymatic 
module adopts multiple conformations and orientations. Accordingly, 
the bulk of this module is visible only as an elongated featureless density 
at the tip of our map. Nevertheless, we could identify its docking site 
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on SAGA, where two helical domains—attributable to Ada3-—lie at the 
surface of the Taf6 HEAT-repeats domain. 


Taf5 orchestrates the main lobe 


The main lobe is dominated by the seven-blade WD40 propeller of Taf5 
(Fig. 1c). This domain occupies a central position, serving asa structural 
hub that connects all major SAGA functions. The flat top face of the 
propeller disc clamps together the TafS N-terminal domain (NTD) and 
the Taf6 HEAT repeats that bind the HAT module. The opposite bottom 
face of WD40 holds the histone-fold machinery that ultimately binds 
TBP and connects to the DUB module and Tral. 

The positions of the Taf5 NTD and Taf6 HEAT repeats are markedly 
different in SAGA compared with TFIID”, as these domains assume dif- 
ferent roles in establishing connections to the enzymatic DUB (the LisH 
extension of Taf5 NTD) and HAT (Taf6) modules (Fig. 1c, Extended Data 
Fig. 4). Compared with TFIID, the TafS5 NTD in SAGA makes a rotation 
of almost 180 degrees around the WD40 domain, and the Taf6 HEAT 
repeats, which are remote in TFIID, are relocated to the vicinity of the 
top face of WD40 in SAGA. Notably, unlike SAGA, TFIID harbours two 
copies of Taf5 and Taf6 that are held together by homodimerization of 
Taf5 NTDs and Taf6 HEAT repeats. As a result of all these variations and 
additional interactions with SAGA-specific subunits, the TafS5 WD40 
environment is very different in SAGA compared with TFIID. 

The ability of Taf6 HEAT repeats to occupy three different positions in 
SAGA and TFIID relative to the histone fold is afforded by a136-residue 
linker between these two domains. The Taf6 linker partially encircles 
the TafS5 WD40 domain (Fig. 1c), and its position is stabilized by several 
interactions along its trajectory. Most notably, the Taf6 linker residues 
T135-L141adda B-strand to the last blade of the WD40 propeller, form- 
ing an inter-protein B-sheet. 

WD40 domains are recognized as potential hubs that mediate pro- 
tein-protein interactions. The TafS WD40 in SAGA is a noteworthy 
demonstration of this capacity, as it associates with at least 11 different 
protein domains (Fig. 1c, d). Taf5 is thus a centrally located master hub 
that orchestrates the architecture of the entire main lobe. 


A deformed histone-fold octamer 


An octamer consisting of four histone-fold pairs arranged in a disc- 
shaped spiral, similar to the nucleosomal histone octamer, has been 
proposed to reside in TFIID and SAGA». Our structure presents an 
atomic model of the full histone-fold octamer that resides at the core of 
SAGA and, most probably with minor variations, inthe TBP-binding lobe 
of TFIID (Fig. 2a). It reveals a disparity between histone-fold interactions 
in SAGA compared with the nucleosome, particularly regarding the 
incorporation into SAGA of the TBP-binding subunit Spt3, ahomologue 
of the TFIID Taf11-Taf13 histone-fold pair”* (Supplementary Video 2). 

In the nucleosome, the interaction between neighbouring histone 
pairs is mediated by a tightly packed four-helix bundle composed of 
helices a2 and «3 of one histone from each pair?°. The SAGA histone- 
fold pairs are organized along the spiral in the following order: Taf6- 
Taf9, Taf12—Adal, Spt7-Taf10 and Spt3, which contains two histone 
folds, one at each of its ends: nSpt3-HF and cSpt3-HF. Whereas all other 
histone-fold pairs are oriented as in the nucleosome, the Spt3 pair is 
tilted by 20 degrees compared with its analogous nucleosome H2A- 
H2B histone pair (Fig. 2b). This tilt prevents participation of the TaflO 
o3-helix inthe interaction, shears the four-helix bundle and therefore 
nearly completely frees Spt3 from its association with the histone-fold 
octamer. The remaining contact is facilitated by a slight tilt in Tafl0 and 
maintained by only very few bonds between Spt3 and Taf10 residues 
(Fig. 2c). To secure incorporation of Spt3 into SAGA, the C-terminal tail 
of Spt3 inserts into the centre of the spiral disc (a void volume in the 
nucleosome), makes a near complete circle and forms single bonds 
with each of the other histone folds in SAGA (Fig. 2d). 


Fig. 2| The octamer of histone folds. a, Structural organization of the four 
pairs of histone folds forming an octamer analogous to the nucleosomal 
histones. b, Left, schematic representation of the histone-fold octamer inthe 
nucleosome (top) and in SAGA (bottom). Right, the four-helix bundle 
connecting the Spt7-Taf10 histone-fold pair (pink) to the partially dislodged 


The Spt3 deviation from the canonical octamer does not originate 
from a single structural determinant. The atypical sequences of the 
histone folds (in particular Tafl0 and Spt3), the long Taf10 loop between 
its a2- and a3-helices and the inherent asymmetry arising from the 
eight different histone-fold proteins involved all contribute. In this 
respect, it is noteworthy that the deviation from the nucleosome pat- 
ternis already apparent at the preceding four-helix bundle where the 
association almost entirely lacks hydrophobic packing (Extended Data 
Fig. 5). We propose that this assembly of histone folds, which deviates 
considerably from the symmetric arrangement in the nucleosome, is 
geared specifically towards a delicate balance between a rigid Spt3 that 
is firmly embedded into the octamer and an overly flexible Spt3 that 
associates with SAGA through loops alone. Below we examine these 
ideas in the context of TBP binding and release from SAGA. 


TBP binding and release machinery 


SAGA presents TBP at the extremity of the main lobe embracing it by 
binding to two opposite ends of its surface and orienting the DNA- 
binding cleft of TBP towards the main body of SAGA (Fig. 3a). The 
principal component in coupling TBP to SAGA and determining its 
position is Spt3. The C-terminal stirrup of TBP is the primary binding 
site for Spt3, similar to the interaction between TBP and TFIIB”. InSAGA, 
the stirrup is nearly completely buried and forms multiple contacts 
in a large pocket created by helices aN and a2 from cSpt3-HF and by 
helix aC, which is joined to nSpt3-HF via a conserved two-amino-acid 
rigid turn (Figs. 2c, 3b). The association of the C-terminal half of the 
pseudosymmetric cTBP with Spt3 is in line with previous biochemical 
and genetic data” (Extended Data Fig. 6). Indeed, accommodation of 
the cTBP N-terminal half by Spt3 is unfavourable, as this would involve 
the negatively charged D81 facing a negatively charged patch on Spt3 
and place the bulky charged R98 where the C-terminal stirrup L189 is 
tightly packed against a hydrophobic crevice (Fig. 3b). 

Incontrast to Spt3, we find that Spt8 is not part of the central module 
(Fig. 3a). It is flexibly tethered via a fuzzy density to the outward tips 
of two long helices, extensions of each end of the Spt7 histone fold. 
The fuzzy density corresponds, at least in part, to a 30-residue-long 


nSpt3-cSpt3 histone-fold pair (brown) compared with the analogous 
nucleosome interaction (grey). c, Detailed view of the interaction between the 
Spt7-Taf10 histone-fold pair and the nSpt3-cSpt3 histone-fold pair.d, The 
C-terminal tail of Spt3 inserts into the centre of the histone-fold octamer. 


unstructured region in the C terminus of Spt7, the deletion of which 
results in loss of Spt8”°. The connection of Spt8 to the N-terminal half 
of cTBP is flexible and appears to have a secondary role in positioning 
TBP. This finding is consistent with deletion mutants of Spt3 in yeast 
showing a sharp decline in global levels of newly synthesized mRNA, 
whereas impairment of Spt8 had almost no effect’. We suggest that Spt8 
has an auxiliary role in tuning the orientation of TBP and in competing 
with other TBP-associated proteins” ™. The flexible tethering of Spt8 to 
the SAGA core contributes to small movements of TBP that are reflected 
inalower local resolution. This ‘breathing’ of TBP might be important 
in transiently enabling initial access to DNA or regulatory factors. 
The space between TBP and the main lobe of SAGA is just wide enough 
to accommodate a double-stranded DNA bent by TBP (Fig. 4a). How- 
ever, the path of DNA is obstructed by structural elements located 
distally to the TBP DNA-binding cleft, notably the peripheral ring of 
the Tral HEAT repeats, as well as the nSpt3-HF helix aC, a major com- 
ponent of the pocket that envelops the C-terminal TBP stirrup. Gel-shift 
assays (Fig. 4b, Extended Data Fig. 7b) and pull-down assays (Fig. 4c, d, 


TBP C-stirrup 


Fig.3| Architecture of the TBP-docking site. a, cTBP interacts with Spt3 
mainly through the stirrup of its C-terminal half (labelled C), whereas its 
N-terminal half (N) binds to the Spt8 WD40 domain. A poorly resolved Spt7 
domain (pink ellipse) connects Spt8 to the central module. b, Enlargement of 
the boxed area ina, depicting the interaction between the C-terminal stirrup of 
TBP anda pocket created by Spt3 helices. 
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Fig. 4 | Steric hindrance for TBP binding to DNA and TFIIA-dependent TBP 
delivery. a, DNA (light blue) was modelled into the DNA-binding cleft of TBP by 
superposing the structure of cCTBP-DNA (PDB: 1YTF) on SAGA-bound cTBP. 
DNAclashes with Tral HEAT repeats and with Spt3 helix aC. b, Gel-shift assay 
using a Cy5-labelled TATA-containing DNA (TATA), a TATA-like DNA anda non- 
specific DNA (TATA-less). DNA probes and TFIIA (as indicated) were addedtoa 
preformed SAGA-TBP complex. The asterisk indicates minor non-specific DNA 


Extended Data Fig. 7a) of DNA binding to SAGA-TBP demonstrate that 
this mild steric hindrance is nonetheless sufficient to impede binding 
of even high-affinity DNA containing a consensus TATA box (Fig. 4b, 
lane 10, Fig. 4c, d). 

The general transcription factor TFIIA enhances DNA binding to TBP, 
competes with Spt8 for occupation of the N-terminal half of cCTBP and 
is found in PICs nucleated by SAGA” **, When a consensus TATA-box- 
containing DNA was incubated with the SAGA-TBP complex together 
with TFIIA, TBP was released from SAGA as a ternary DNA-TBP-TFIIA 
complex (Fig. 4b, lane 3, Extended Data Fig. 7d). The binding of DNAto 
TBP is coupled with the release of TBP, as we did not observe any DNA 
associated with SAGA-bound TBP. Indeed, the release of TBP from SAGA 
can now be understood as being essential for the subsequent steps in 
assembling the PIC because the clasping of the TBP C-terminal stirrup 
by Spt3 and TFIIB are mutually exclusive. TBP was also detached from 
SAGA, although less efficiently, when DNA bearing the TATA element 
containing two mismatches was used (Fig. 4b, lanes 5 and 6), whereas 
no release of TBP was observed when the DNA did not contain any TATA 
or TATA-like elements (Fig. 4b, lanes 7, 8). These experiments show that 
the detachment of TBP from SAGA correlates with the affinity of TBP 
for its DNA substrate. 

Our structure suggests that the TFIIA-stimulated mechanism for 
the concerted DNA binding and TBP release from SAGA includes the 
following steps: TFIIA first displaces Spt8 from TBP; it then promotes 
an initial binding of DNA to TBP, thus shifting helix aC of Spt3, which 
obstructs the path of DNA. Notably, as helix aC is firmly joined to helix 
«3 innSpt3-HF, this crucial step is made possible because the deformed 
octamer does not immobilize this histone fold via a tight four-helix 
bundle. Shifting helix aC breaks the pocket that clasps the C-terminal 
stirrup; TBP can then tilt, allowing DNA to pass in the gap between 
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association with SAGA. c, d, Pull-down assays with beads that capture SAGA. 
Components were labelled with different fluorophores and incubated as for 
gel-shift experiments. The proportion of fluorescent signal that is due to the 
unbound (c) and bound (d) complexes is presented. The rightmost bars 
represent experiments in which TBP was first incubated with DNA and TFIIA 
before the addition of SAGA. Experiments were repeated three to four times. 
Data are shownas dots for individual experiments and mean ¢+s.d. 


Tral and the main body of SAGA, completing the binding to DNA. Final 
release of TBP from SAGA might still require a further conformational 
change in Spt3. Spt3 alone is unable to bind TBP”, and it is tempting to 
speculate that by shifting helices aC-a3 of nSpt3-HF, partially loosening 
Spt3 ties with SAGA, a conformation similar to isolated Spt3 ensues. 
Following the release of TBP and DNA, helix aC can regain its position, 
enabling Spt3 to restore its TBP-binding conformation. 

Ithas been suggested that in TFIID, the association of TBP with spu- 
rious DNA is prevented by a helical protein domain that mimics TATA 
DNA and binds at the DNA cleft of TBP®. Remarkably, our maps do not 
show any substantial density attached at the DNA-binding cleft of TBP. 
Instead, our structure reveals a mechanism for preventing association 
of non-specific DNA based on limited steric hindrance, distal to the DNA 
cleft in TBP. Only the synergy of a cognate TATA or TATA-like DNA with 
TFIIA can overcome this hindrance. 


Firmly tethered flexible enzymatic modules 


The DUB and HAT modules are highly dynamic (Fig. 1a, b, Extended 
Data Fig. 8a, b), permitting the exploration of a large conformational 
space in search of their chromatin substrates. The structure shows 
how these modules maintain a robust binding to SAGA despite this 
flexibility. The DUB module consists of three subunits and the Sgf73 
N-terminal end. The C-terminal part of Sgf73 was implicated in attach- 
ing DUB to SAGA, and deletion analysis narrowed down the anchoring 
region to residues 350-400 in Saccharomyces cerevisiae (227-277 in K. 
phaffii), as removal of this region leads to dissociation of an intact DUB 
module from SAGA***”. We found that these residues form an elongated 
domain comprising two helices, Sgf73-H1 and Sgf73-H2, connected by 
a stretched loop and embedded in the central module (Fig. 5a). This 


Spt? Taf12 


a 


Fig. 5| Connecting the DUB module and Tral to the central module. a, The 
Sgf73 domain that contains helices Sgf73-H1 and Sgf73-H2 (cyan) anchors the 
DUB module within the central module. Sgf73-H2 is embedded ina chamber 
formed by Spt20 (pink) and Adal (dark grey). The insert highlights the depicted 
region within the overall structure of SAGA. b, c, Bridges between the central 
module and Tral (tan) established by Spt20 (pink) (b) and Taf12 (green) (c). 


DUB-anchoring domain traverses the core, entering proximal to the 
Taf9 histone fold and exiting next to the Spt20 SEP domain. Sgf73-H2 
lies ina chamber-like structure composed of several helices from the 
Spt20 SEP domain and from an elongated domain of Adal. The chamber 
consists exclusively of SAGA-specific elements with no contribution 
from subunits shared with TFIID. Thus, a dedicated environment is 
created at the core of SAGA, partially occupied in TFIID by the Taf5 
NTD, to house the helix that anchors the DUB module into SAGA. The 
DUB module is tethered to the SAGA core through a164-residue poorly 
resolved linker that connects the anchoring domain to the N-terminal 
end of Sgf73 (Extended Data Fig. 8c, d). 

In contrast to the DUB, no parts of the HAT module are embedded 
in the SAGA core (Fig. 1b). The HAT densities emerge, harbouring two 
Ada3 helical domains, at the surface of the Taf6 HEAT repeats that are 
used for homodimerization in TFIID (Extended Data Fig. 8e, f). 

Thus, the two enzymatic modules of SAGA adopt very different 
strategies to remain bound to SAGA while maintaining their freedom 
of movement. The DUB module is a stably folded structure” that 
gains independence through an unstructured long linker that has little 
contact with the SAGA surface until it connects to an extension firmly 
embedded in the SAGA core. The HAT module, on the other hand, is 
itself a flexible structure”, with several submodules as well as a large 
proportion of intrinsically unfolded sequences, and it binds only tothe 
surface of the SAGA core, using two helical domains of Ada3. 


Network of unstructured protein elements 


Three distinct bridges serve to physically and possibly functionally 
couple Tral to the central module. The first bridge is established by a 
highly dispersed domain of Spt20 that originates as a strand joining 
the Spt20 SEP B-sheet, followed by a helix-turn-helix that also interacts 
extensively with the Spt20 SEP domain (Fig. 5b). It continues as aloop of 
around 70 residues, touching several subunits as it traverses the surface 
of the SAGA core onits 160 A-long way to Tral. There, it forms a helix 
that associates with the surface of the FAT (FRAP, ATM and TRRAP) 
domain and is followed by a second helix, which is clamped within a 
concave structure in Tral. This second helix corresponds to the Spt20 
region that is denoted as the HIT domain and the deletion of which 
prevents assembly of Tral into SAGA*®. An exceptionally long loop 
that precedes the histone fold of Taf12 is responsible for establishing 
the second bridge (Fig. 5c). Residues 430-474 from this loop trace a 
lasso-shaped thread at the surface of the Tral FAT domain within the 
deep groove formed between two tetratricopeptide repeat-containing 
domains (TRDs). The third bridge is formed by the loop connecting the 


second and third helices of cSpt3-HF as it infiltrates between two HEAT 
repeats of the Tral ring (Extended Data Fig. 9a, b). 

We also observe several protein stretches that lack secondary struc- 
ture elements but contact multiple subunits and contribute consider- 
ably to an intricate intertwined network of interactions within the main 
lobe of SAGA. For example, the C-terminal tail in Taf9 and the short 
loop preceding it contact four histone folds and six other domains 
(Extended Data Fig. 9c). This network of interactions is necessary for 
stabilizing an octamer that could not be reconstituted in vitro from its 
constituent histone folds and underlies the effects of some mutations 
on functions that do not directly involve the altered proteins”. 


Conclusion 


The high-resolution description of SAGA unravels how the central 
module acts as a scaffold that coordinates an activator-interaction 
platform and two enzymatic modules, as well as TBP binding and 
release. Our structure reveals that a deformed histone-fold octamer, 
aided by components of Spt7 and Spt8, is finely tuned to define a TBP- 
docking site in which distal steric hindrance represses DNA binding, 
thus preventing unwanted transcription initiation. However, with the 
assistance of TFIIA, TBP can productively bind DNA to forma ternary 
TBP-TFIIA-DNA complex that is released from SAGA. The efficiency 
and probability of TBP detachment is commensurate with the affinity 
of TBP for the DNA sequence. 

These findings imply a short residency time for SAGA onthe minimal 
promoter, in accordance with genome-wide analyses that locate SAGA 
mainly at upstream activating sequences (UASs)”*°. The peripheral 
position of TBP and its binding mode in SAGA also suggest that tran- 
scription regulatory factors such as negative cofactor 2 (NC2) or Motl 
could gain access to TBP with relative ease. These results are consist- 
ent with a highly dynamic scenario for the control of TBP turnover". 

Sequence homology strongly suggests that the histone-fold machin- 
ery in TFIID assumes a similar function, which implies that the mechan- 
ics of depositing TBP on DNA by the two complexes share common 
features. However, several crucial aspects of TBP binding and delivery 
vary between the two complexes, including the capacity of the TFIID 
subunits Taf2 and Taf4 to bind promoter DNA, the ability of the Tafl 
NTD to interact with the DNA-binding interface of TBP and the stronger 
affinity of Tafl1-13, compared with Spt3, for TBP”. Thus, access of tran- 
scription regulatory factors to TBP and residency time at the minimal 
promoter are expected to be significantly different between TFIID 
and SAGA. 
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Methods 


No statistical methods were used to predetermine sample size. 


Purification of SAGA 

The SAGA complex was purified from nuclear extracts of the budding 
yeast K. phaffii (also known as P. pastoris) using the 38-amino-acid 
streptavidin-binding peptide (SBP) affinity tag that was fused to the 
C terminus of the endogenous Sgf73 subunit (Extended Data Figs. 1, 
7e). We introduced several modifications into our original procedure“, 
with the combined effect of increasing the yield by three times. These 
improvements enabled us to reduce the harmful effects of proteases 
and to keep SAGA more concentrated along the different steps of the 
procedure. 

Two litres of yeast cells (four flat-bottom flasks with 0.51 in each) were 
grownat 24 °C with glycerol as carbon source and collected when opti- 
cal density at 600 nm (ODgoonm) reached 16-18. Cells were washed in ice- 
cold water and then treated with 10 mM DTT. The cell wall was digested 
by addition of lyticase and spheroplasts were pelleted at 5,500g for 
20 min. All further steps were performed at 0 to 4 °C. Protease inhibi- 
tors were added to all buffers. Spheroplasts were washed twice in1.1M 
sorbitol and were then disrupted by suspension in a hypotonic buffer 
(15-18% Ficoll 400, 0.6 mM MgCl,, 20 mM K-phosphate buffer pH 6.6) 
using a ULTRA-TURRAX disperser. Sucrose (0.1 M) and MgCl, (5 mM) 
were then added. Nuclei (and some debris) were pelleted at 33,000g for 
37 min, resuspended ina wash buffer (0.6 M sucrose, 8% polyvinylpyr- 
rolidone, 1mM MgCl, 20 mM phosphate buffer pH 6.6) and pelleted 
again at 34,000g for 50 min. Nuclei were resuspended in low ionic 
extraction buffer (40 mM HEPES pH 8.0, 20% sucrose, 8 mM MgCl, 
5-6 mM DTT) with 30 strokes using a tight pestle in a dounce homog- 
enizer. Very few molecules of SAGA are liberated under the low ionic 
conditions, leaving time for protease inhibitors to bind their targets 
before SAGA is extracted. 

After 20 min of incubation, 300 mM NaCl, 2 mM CaCl, and 150 pl of 
a-amylase solution (MegaZyme) were added. Following another 30 min 
of incubation, debris was precipitated at 33,000g for 38 min. The super- 
natant was collected and 1-2% PEG 20,000 as well as additional 5 mM 
MgCl, were added to precipitate some remaining organelles and mem- 
brane parts by a short centrifugation step at 33,000g for 10 min. The 
increase in magnesium concentration (final 13 mM) proved crucial to 
prevent precipitation of SAGA at this step. The PEG 20,000 concentra- 
tion was then increased to 5.8% and SAGA precipitated in asecond short 
centrifugation step. The pellet was solubilized in a minimal volume and 
avidin was added to block endogenously biotinylated proteins. The 
suspension was incubated with streptavidin beads for 4 hin buffer A 
(40 mM HEPES pH 8.0, 250 mM sodium chloride, 10% sucrose, 2 mM 
MgCl,,2mM DTT) washed 5 times and eluted with buffer A containing 
10 mM biotin. The eluate was concentrated with Millipore Amicon-Ultra 
(50 kDa cut-off) and spun in a 10-30% sucrose gradient with buffer B 
(20 mM HEPES pH 8.0, 150 mM potassium acetate, 2mM DTT, 6mM 
MgCl,) in rotor SW60 (38,300 rpm for 13.5 h). SAGA was fractionated at 
approximately 25% sucrose and concentrated with Amicon-Ultrato~2mg 
ml”. It is important to note that mass spectrometry (Extended Data 
Fig. 7) and SDS-PAGE analysis“ show no cleavage of the Spt7 subunit 
or sub-stoichiometric amounts of Spt8. It is also noteworthy that very 
little endogenous TBP is present in our final SAGA sample. 


Purification of TBP and TFIIA 

Full-length yeast S. cerevisiae TBP was expressed as N-terminal 6xHis- 
GFP fusion protein in Escherichia coli strain BL21(DE3)RIL. Cells were 
grown in LB medium with ampicillin until an OD, pm Of 0.7 was 
reached. Expression was induced with 1 mM IPTG for 3 h at 37 °C. The 
cells were collected by centrifugation and resuspended in lysis buffer 
(20 mM HEPES-KOH (pH7.5), 0.5 M NaCl, 10% glycerol). Cells were 
treated with 1 mg mI lysozyme (Sigma) for 30 min on ice, and then 


lysed by sonication after the addition of cCOmplete, EDTA-free Protease 
Inhibitor Cocktail (Roche) and PMSF toa final concentration of 1mM. 
The resulting lysate was clarified by centrifugation and incubated 
with equilibrated cOmplete His-Tag Purification Resin (Roche) for1h 
at 4 °C. After the beads were collected, they were washed three times 
with lysis buffer supplemented with 5 mM imidazole, and the bound 
protein was eluted with 20 mM HEPES-KOH (pH 7.5), 0.2 M NaCl, 10% 
glycerol supplemented with 250 mM imidazole. The recombinant 
protein was further purified on a Heparine HiTrap 1 ml column (GE 
Healthcare) equilibrated with 20 mM HEPES-KOH (pH 7.5), 0.2 M NaCl, 
20% glycerol and eluted with a gradient from 200 mM to1M NaCl. Frac- 
tions were analysed by Coomassie blue staining, pooled and concen- 
trated in Amicon-Ultra with a 10,000 (10K)-molecular-weight cut-off 
(Millipore) to a final concentration of 3 mg mI“. DTT (5 mM) was added 
to all buffers immediately before use. The yeast S. cerevisiae TFIIA was 
expressed in £. coli, purified from inclusion bodies and reconstituted 
as described earlier*’. 


Gel-shift DNA-binding assays 

DNA probes were obtained by annealing of equimolar amounts of com- 
plementary oligonucleotides (Eurofins Genomics for short DNA, SIGMA 
for long DNA) ata final concentration of 25 uMin 20 mM HEPES-KOH, 
50 mM NaCl by heating the mixture to 100 °C for 5 min and cooling 
slowly down to room temperature. To estimate the quality of annealing, 
the obtained DNA fragments were loaded on a12% polyacrylamide gel 
and analysed using a Typhoon FLA9500 Imager (GE Healthcare) to fol- 
low CyS fluorescence signal. No Cy5-labelled single-stranded DNA was 
detected (Extended Data Fig. 7c). To further ascertain that we removed 
all effects of single-stranded DNA, all DNA samples were treated with 
Exonuclease I (NEB). Binding assays (10 pl) contained 3 mM Tris-HCl 
(pH 8.0), 6 mM HEPES-KOH (pH 8.0), 45 mM KAc, 20-42 mM KCI, 5 
mM MgCL,, 3% glycerol, 6% sucrose, DNA probe (Cy5-labelled), 0.01% 
NP4O, 25 pg ml! bovine serum albumin (BSA) (NEB), 1 mM DTT and 
proteins as indicated. TBP was incubated first with a fourfold excess 
of SAGA complex for Ihat 4 °C. Under these conditions almost all TBP 
was bound to SAGA. Then DNA and TFIIA were added, and the reaction 
mixtures were incubated for 20 min at room temperature before they 
were loaded onto agarose-polyacrylamide native gels (1% agarose, 1.3% 
acrylamide with 1x Tris-glycine buffer, 5 mM MgCl, in the gel and run- 
ning buffer). The final reaction mix contained 0.075 tM GFP-TBP, 0.3 
LLM SAGA, 0.075 1M DNA and 0.15 pM TFIIA complex. Binding assays 
with long (100-bp) DNA fragments were done under similar condi- 
tions except that a threefold excess (with respect to SAGA) of long 
non-labelled TATA-less DNA was also added because SAGA was shown 
to bind long DNA non-specifically**. 

The gel-shift experiments were repeated several times, all show- 
ing consistent results. Gels were analysed using a Typhoon FLA9500 
Imager to follow CyS fluorescence signal. TBP-DNA and TBP-TFIIA- 
DNA migrate similarly in native agarose-polyacrylamide gels due to 
the limited resolving power of these gels for relatively low-molecu- 
lar-weight complexes. We ascertained the composition of the shifted 
bands by mass spectrometry (Extended Data Fig. 7d). Sequences of 
DNA molecules used: ‘TATA’ (yeast CYCI TATA), 5’-CyS-TGCTCTG- 
TATGTATATAAAACTCTTG-3’; ‘TATA-like’ (CYC1 TATA with two muta- 
tions decreasing the affinity to TBP), 5’-CyS-TGCTCTGTATGTA GAGAA 
AACTCTTIG-3’; ‘TATA-less’ (non-specific DNA, CYCI UAS), 5’-Cy5- 
GGCCGGGGTTTACGGACGATGGCAG-3’; ‘TATA,’ (Pichia pastoris pGAP 
promoter fragment), TGTCATGAGAT TAT TGGAAACCACCAGAAT 
CGAATATAAAAGGCGAACACCTT TCCCAAT TT TGGTT TCTCCTGACCC 
AAAGACTTTAAATTTAATTTA; ‘TATA-like,’ (TATA, with two mutations 
decreasing the affinity to TBP), TGTCATGAGAT TAT TGGAAACCACC 
AGAATCGAAGAGAAAAGGCGAACACCTT TCCCAATT TTGGTTTCTCCT 
GACCCAAAGACTTTAAATTTAATTTA; ‘TATA-less,’ (non-specific DNA), 
GACTCATGTCATGAGATCATTGGACACCACCAGAATCGCGTATCGAAGG 
CGAACACCTGTCTCACGTCTGGTGTCTCCTGACGCACAGACTTCGAA 
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CGTA. The position of the TATA sequence is indicated by bold letters 
and the two mismatches in the TATA-less sequences are underlined. 


Pull-down DNA-binding assays 

Binding assays (20 pl) contained 3 mM Tris-HCl (pH 8.0), 5 mM HEPES- 
KOH (pH 8.0), 35 mM KAc, 46 mM KCI, 5 mM MgCL,, 3% glycerol, 6% 
sucrose, 0.01% NP40, 25 pg mI BSA (NEB), 1 mMDTT, proteins and DNA 
probe (Cy5-labelled ‘TATA’ DNA) as indicated. TBP was incubated first 
with a fourfold excess of SAGA complex for 1h at 4 °C. Then DNA and 
TFIIA were added (as indicated), and the reaction mixtures were incu- 
bated for 20 min at room temperature. The final reaction mix contained 
0.075 1M GFP-TBP, 0.3 LM SAGA (with Sgf11 subunit fused to mCherry), 
0.075 uM Cy5-DNA and 0.15 tM TFIIA complex. Ten microlitres of 
streptavidin-sepharose beads (equilibrated with the binding buffer) 
were added to each reaction mix and tubes were incubated for 2 hat 
4 °Cwith shaking using ThermoMixerC (Eppendorf, shaking frequency 
1,220 rpm). Each reaction mix containing beads was transferred into an 
Ultrafree-MC centrifugal filter unit and centrifuged for 5 min at 12,000g 
at 4 °C. Beads with retained protein or DNA-protein complexes and the 
flowthrough with unbound material were analysed using a PHERAstar 
Plus microplate reader (BMG LABTECH) to quantify the fluorescence 
signal of GFP (corresponding to TBP), mCherry (corresponding to 
SAGA complex) and Cy5 (corresponding to DNA). Acomplementary 
experiment with the reverse order of incubations was also carried out. 
Hence, TBP was first incubated with Cy5-labelled DNA and TFIIA for 20 
min at room temperature, then a fourfold excess of SAGA complex was 
added and reactions were incubated for 1h at 4 °C. Experiments were 
repeated three to four times. 


Cryo-EM sample preparation and data acquisition 

Concentrated SAGA was incubated overnight on ice with an excess of 
ubiquitin aldehyde (UbAI) at a molar ratio of 1:4. Complete binding 
and inhibition was ascertained by following the activity of SAGA using 
Ubiquitin-AMC (Enzo). SAGA-UbAI complexes were precipitated with 
PEG to remove sucrose“, suspended at a concentration of 1mg mI! 
and incubated overnight on ice with a 5:1 molar excess of full-length 
S. cerevisiae TBP. SAGA-TBP was then diluted to 0.2 mg mI and cross- 
linked with 0.1% glutaraldehyde for 30 min on ice. About 3 pl of sam- 
ple was applied onto a holey carbon grid (Quantifoil R2/2 300 mesh) 
rendered hydrophilic by a 90-s treatment in a Fischione 1070 plasma 
cleaner operating at 30% power with a gas mixture of 80% argon:20% 
oxygen. The grid was blotted for 1s (blot force 8) and flash-frozen in 
liquid ethane using a Vitrobot Mark IV (FEI) at 4 °C and 95-100% humid- 
ity. To verify that the occupancy of GFP-TBP on SAGA was approaching 
100%, we loaded the cross-linked complex ona superose 6 10/300 GL 
column (GE Healthcare) and compared the fluorescent signal at the 
peak fraction with that of cross-linked GFP-TBP of known concentra- 
tions. The concentrations of SAGA were estimated by UV absorbance. 
Images were acquired on a Cs-corrected Titan Krios (Thermo Fisher) 
microscope operating at 300 kV in nanoprobe mode using the serialEM 
software for automated data collection*. Movie frames were recorded 
ona4k~x 4k Gatan K2 summit direct electron detector after aQuantum 
Ls 967 energy filter using a 20-eV slit width in zero-loss mode. Images 
were acquired in super-resolution mode at anominal magnification of 
105,000, which yielded a pixel size of 0.55 A. Forty movie frames were 
recorded at a dose of 1.32 e A“ per frame, corresponding to a total 
dose of 52.8 e- A, but only the last 38 frames were kept for further 
processing. 


Image processing 

Movie frames were aligned, dose-weighted, binned by 2 and averaged 
using Motioncor2* to correct for beam-induced specimen motion and 
to account for radiation damage by applying an exposure-dependent fil- 
ter. Non-weighted movie sums were used for contrast transfer function 
(CTF) estimation with the Gctf" program, while dose-weighted sums 


were used for all subsequent steps of image processing. After manual 
screening, images with poor CTF, particle aggregation or ice contami- 
nation were discarded. Particles were picked using crYOLO (https:// 
doi.org/10.1101/356584). These datasets were analysed in RELION 
3.048, cryoSPARC” and cisTEM”™ according to standard protocols. In 
brief, three rounds of reference-free 2D classification of the individual 
particle images were performed in RELION to remove images, like ice 
contaminations or deformed particles, that failed to enter into high- 
resolution class averages. The selected images were aligned against a 
low-pass-filtered starting model”. Three rounds of 3D classification 
with increasing regularization parameter 7 were performed and classes 
showing high-resolution features were selected for the subsequent 
steps. These images were imported, and refined in cryoSPARC v.2. 
Three-dimensional classification of the entire dataset could not clearly 
separate distinct conformations of the SAGA complex. Therefore, we 
carried out a focused refinement of the separate lobes using the masked 
lobes as references. The Tral lobe was refined witha mask covering the 
Tral densities using the ‘Local refinement’ module. The structure of 
the main lobe was obtained by using signal subtraction, in which the 
densities corresponding to the Tral lobe were subtracted from the 
images. This operation was followed by local refinement with a mask 
covering the main lobe densities. Global resolution estimates were 
determined using the Fourier shell correlation (FSC) = 0.143 criterion 
after a gold-standard refinement. Local resolutions were estimated with 
ResMap”™ and cryoSPARC. Maps to fit TBP, Spt8 and the DUB module 
were obtained after 3D classifications using RELION 3 by applying masks 
covering the region of interest. 

To describe the flexibility of the DUB and HAT modules, the selected 
SAGA images were analysed using the IMAGIC software package”. The 
selected images were aligned against 75 equidistant projections of the 
3D modeland clustered to form1,000 2D classes. Images from classes 
corresponding to two different views were grouped and aligned sepa- 
rately against their corresponding reference. Local classification into 
50 classes was performed for each group by using an adapted mask 
and the best class averages were selected to show the extent of protein 
density variation within each view. 


Model building 

Pairwise alignment was performed with Needle’. Homology models 
for Tral, TBP, Spt8, DUB module, all eight histone folds, Taf6é HEAT 
repeats, Taf5 WD40, Spt20 SEP domain and Taf5 NTD domains were 
generated using I-TASSER™ or Swiss-Model and docked into the maps 
manually. The rigid-body-docking of DUB and Spt8 was guided by the 
overall shape of the local density, which lacked clear secondary struc- 
ture features, probably owing to the peripheral position and flexible 
nature of their connection to the SAGA core. The position of the DUB 
is consistent with previous studies, notably XL/MS* and negatively 
stained particles with GFP-labelled subunits of the module”. As the 
DUB is composed of two lobes with similar overall shape we cannot 
exclude at the current resolution an inverted (upside-down) orienta- 
tion for the DUB. The location and orientation of Spt8 is validated by 
cross-linking mass spectrometry”, specific single-site photo-activated 
cross-linking”® and biochemical data’. 

The other homology models, with the notable exception of TBP, 
for which only a minor tilt of the C-terminal stirrup was applied, were 
rebuilt according to density using Coot® and then extended from 
their N’ and C’ termini until a significant discontinuity in the map was 
encountered. Such discontinuities normally corresponded to sites 
where RaptorX™ and PSIPRED” predictions indicated a large disor- 
dered region. To assign densities that were not clearly connected to 
any of these extended starting models, we considered the local map 
properties including secondary structure features, proteins in the 
vicinity, biochemical data, cross-linking information for surrounding 
domains and, most importantly, the fit between protein side chains 
and the density. These parts of the structure were then built de novo. 


For alarge proportion of the side chains, density was clear and practi- 
cally all densities in the core of SAGA and Tral could be assigned with 
confidence. The single exception to this procedure is the Ada3 domain 
at the docking site of the HAT module, where density showed second- 
ary structure elements but could not resolve individual side chains. 
We cannot therefore exclude errors in amino-acids register for this 
domain. Tral and the main lobe were separately geometry-optimized 
in PHENIX® followed by real-space refinement with secondary struc- 
ture restraints against the corresponding post-processed maps. This 
procedure was repeated after a round of final manual corrections to 
the structure in Coot. All display images were generated using UCSF 
Chimera” and ChimeraX®. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The cryo-EM maps have been deposited in the Electron Microscopy 
Data Bank (EMDB) under accession codes EMD-10438 (SAGA-TBP), 
EMD-10440 (SAGA-TBP, refined for Tral lobe), EMD-10441 (SAGA-TBP, 
refined for main lobe), EMD-10446 (SAGA-TBP, low-pass-filtered), EMD- 
10448 (SAGA-TBP, focused classification on TBP), EMD-10447 (SAGA 
without TBP). The model coordinates for SAGA-TBP were deposited 
in the Protein Data Bank (PDB) under the accession codes 6TB4 and 
6TBM (including Spt8 and DUB). 
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Extended Data Fig. 1| Subunit and domain organization of the yeast SAGA 
complex. a, SAGA subunits are organized into the Tral lobe and the main lobe. 
The main lobe is divided into enzymatic HAT and DUB modules, acentral 
module anda TBP-binding module. Overlaps appear as the central module isa 
scaffold for the assembly of all other modules. Abbreviations used owing to 
space limitations: Taf5 lissencephaly homology (LisH) domain (T5L), Taf6 linker 
domain (N83-S218, T6L), Taf9 C tail (N110-L139, T9T), Sgf73 NTD (M1-K69, ND), 
Sgf73 linker region (I70-H234, S73L), Sgf73 anchoring domain (E235-L277, 
S73A), Spt20 HIT domain interacting with Tral (M325-L340, HIT), Ada3 
anchoring domains (K398-L471, A3A1) and (Y509-E557, A3A2) and 
bromodomain (BRD). The SEP domain found in Spt20 is named after 


Spt20 


Kinase 


S. cerevisiae Shp1, Drosophila melanogaster eyes closed gene (Eyc) and 
vertebrate P47. The FAT repeats are divided into three TRD domains and one 
HEAT-repeats domain (HRD). We defined three bridge-forming domains 
involved in connecting Tral to the central module: the Spt20 bridge (K184- 
K336, S20B), the Taf12 bridge (T430-K502, T12B) and the Spt3 bridge (E268- 
1304, S3B). The histone-fold domains, present in seven subunits, are 
highlighted as yellow boxes. b, Schematic representation and helix 
nomenclature for histone-fold domains. c, Original micrograph of frozen- 
hydrated P. pastoris SAGA-T BP complex. d, Two-dimensional class averages 
showing high-resolution structural features. 
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features. Cryo-EM map and atomic model showing that side chains are clearly 
identified. b, Histone-fold domain of Taf9. c, Part of the Taf5 WD40 domain. 
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Extended Data Fig. 3| Fitting of TBP and Spt8 into the cryo-EM map. 

a, Pairwise alignment of TBP fromP. pastoris and S. cerevisiae. Eight residues 
(out of 180) differ between the two organisms within the conserved C-terminal 
part of TBP (marked bya dotted line). These residues are mapped on the 
structure of cTBP (PDB: 1YTF). None of these residues occur in regions that 
contact SAGA (highlighted in red). b, Pairwise alignment of TFIIA subunits, 
Toaland Toa2, fromP. pastoris and S. cerevisiae. The evolutionary conserved 
and structured domains of TFIIA (marked by dotted lines) show 60% identity 
and 80% similarity between the two organisms. It is worth noting that TBP and 
TFIIA have both been shown to be highly similar even across large evolutionary 
distances. For example, TBP, as well as Toa2, from yeast and human are 
functionally interchangeable in Pol II transcription” © and yeast TFIIA 
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complements a mammalian in vitro transcription system depleted of TFIIA*. 
Hence, TBP and TFIIA from S. cerevisiae, which are easier to overproduce in 
E.coli, are valid substitutes in our experimental system for their homologues 
from the closely related budding yeast P. pastoris.c, Cryo-EM reconstructions 
determined in the presence (green) and in the absence (pink) of TBP. The 
enlarged panel shows the superimposition of both maps and highlights the 
additional density corresponding to TBP. d, Fitting of the cT BP crystal 
structure into the additional density observed in the SAGA-TBP complex. The 
a-loop denotes an a-helix from the linker connecting the two histone folds in 
Spt3.e, f, Fitting of the WD40 repeat of subunit Spt8 into the cryo-EM map of 
the SAGA-TBP complex next to TBP. 


a SAGA TFIID 


Taf6 HEAT 


TafS NTD 


Taf5 WD40 


Taf5 WD40 


Taf6 HEAT 


TafS NTD 


Extended Data Fig. 4 | Comparing the arrangement of common SAGA and 
TFIID subunits. a, The Taf5 NTD and Taf6 HEAT repeats have different 
positions in SAGA and in TFIID. The structures of Taf5 NTD, Taf6 HEAT repeats 
and Taf5 WD40 domain in SAGA (left) and in the twin lobe of TFIID” (right) are 
depicted. For the sake of clarity, all other parts of SAGA and TFIID were 
removed. To compare the position of the Taf5 NTD and Taf6 HEAT repeats 
between SAGA and TFIID, both complexes were aligned to have the same 
orientation of the TafS5 WD40 domain. The top face of the WD40 repeat, which 
istoalarge extent freein TFIID, is nearly completely engaged in SAGA. The 
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positions of the Taf5 NTD and Taf6 HEAT repeats are defined by direct 
interactions of the two domains between themselves, with the WD40 surface, 
and with the a-helix A352-D364 of Taf5. b, SAGA and TFIID share five subunits 
(Taf and the four histone fold-containing subunits Taf6, Taf9, Taf10 and Taf12). 
These subunits are present as one and two copies in SAGA and TFIID, 
respectively. The relative positions of these subunits are depicted in both 
complexes. Note that in TFIID, the TBP-binding lobe was poorly resolved’? 
and therefore the second copy of only the Taf6 HEAT repeats and Taf5 NTD is 
shown. 
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Extended Data Fig. 5 | Interactions between histone-fold pairs in SAGA 
deviate from the nucleosome pattern already at the four-helix bundle 
preceding the weak association between Spt3 and Taf10. The binding 
between the first two histone-fold pairs along the spiral (left), Taf6-Taf9 with 
Taf12-Adal, shows little difference from the nucleosome, possibly reflecting 
the fact that these histone folds have the highest sequence similarity to 


histones. As inthe nucleosome analogue, hydrophobic interactions havea very 
important role inthis binding. By contrast, the four-helix bundle preceding the 
association between Spt3 and the Spt7-Taf10 pair, linking Taf12—Adal to Spt7- 
Taf10, is maintained almost solely by hydrogen and electrostatic bonds. Only 
side chains of residues that participate in the interactions are shown. 
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Extended Data Fig. 6 | Mapping previously isolated mutations in Spt3 and are mapped on the structure of Spt3 and TBP in the SAGA-TBP complex. The 
TBP. Several classes of Spt3 and TBP mutations were shown to interact mutations appear to cluster at the contact sites between TBP and Spt3. 


genetically with respect to transcription. Mutations from two major studies””** 
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Extended Data Fig. 7 | TFIIA-dependent TBP release from SAGA. comparison. d, Summary of mass-spectrometry data for the major shifted DNA 
a, Experimental set-up for pull-down assays that investigate TBP and DNA band (Fig. 4, lane 3) showing that it contains both TBP and TFIIA as its major 
interactions with SBP-tagged SAGA. b, Gel-shift assay as presented in Fig. 4b constituents. The spectral count divided by the length in amino acids (PSM/AA) 
but using a longer (100-bp) Cy5-labelled DNA: TATA-containing promoter of each protein was normalized with respect to PSM/AA of TBP (raw data are 
(TATA,), a TATA-like DNA probe (TATA-like,) anda non-specific DNA probe provided in Supplementary Table 1).e, Summary of mass-spectrometry data of 
(TATA-less,). The asterisk indicates non-specific DNA association with SAGA. purified SAGA. To estimate stoichiometry, PSM/AA was calculated for each 
c, Cy5-labelled DNA was almost exclusively double-stranded. Annealed protein and normalized with respect to PSM/AA of subunit Tral (raw data are 
Cy5-labelled DNA used in gel-shift and pull-down assays was loaded ona provided in Supplementary Table 2). Experiments were repeated three to four 


polyacrylamide 12% gel. Single stranded Cy5-labelled TATADNAwasloadedfor times. 
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Extended Data Fig. 8 | Flexibility and anchoring of the enzymatic modules. 
a, Two-dimensional classes of SAGA side views revealing different positions of 
the enzymatic HAT and DUB modules. Bottom, approximate positions of the 
modules are highlighted in yellow for the HAT module and in blue for the DUB 
module. The HAT module seems more diffuse and flexible than the DUB 
module, whose position fluctuates less. b, Docking of the DUB crystal structure 
(PDB: 6AQR) into the 15 A resolution cryo-EM map reconstructed froma small 
class of particles that probably share a similar position for the DUB.c, Domain 
organization of the Sgf73 subunit. d, The Sgf73 linker tethers DUB to the 
central module. The linker (S73L) connecting the N-terminal end (ND), whichis 
part of the DUB, and the anchoring domain (S73A) embedded in the central 
module, includes a SCA7 domain but additionally includes roughly 90 amino 
acids predicted to be unstructured. All residues that cross-link® to the Sgf73 
linker can be mapped on the surface of SAGA. These residues are depicted with 
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light-blue spheres. A putative path for the 164-residue-long linker is delineated 
(dashed) according to the cross-linking sites and scarce traces of the linker 
found inthe cryo-EM map. The top insert shows the density that forms contact 
between DUB and the central module. This contact point is located next to 

the Spt20 SEP and Taf5 LisH domains and is the only trace of the flexible Sgf73 
linker in our maps. e, Two helical domains (ochre) anchor the HAT module at the 
surface of the Taf6 HEAT repeats (red). f, These helical domains are assigned to 
Ada3. Domain-deletion analysis identified two domains in Ada3 and onein 
Ada2 that serve to dock HAT on SAGA and whose deletion results in the release 
of intact HAT from SAGA*. Fit between the maps and secondary structure 
predictions (PSIPRED 3.3) or 3D structure of the Ada3 and Ada2 domains is 
presented in addition to available cross-linking data between these domains 
and the Taf6 HEAT repeats. 
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Extended Data Fig. 9 | Protein loops and tails contribute to an intricate 
network of interactions and to bridges between the central module and 
Tral.a, Bridge established by the loop connecting the second and third helices 
of the cSpt3 histone fold (brown). b, The loop between helices a2 and «3 of the 
cSpt3 histone fold inserts between two helices of Tral. Aromatic stacking 
interactions are formed between Spt3 F294 of this loop and F24.04 and F2422 of 
Tral. The map density for these three phenylalanine residues is depicted.c, The 


C-terminal tail of Taf9 (R101-L139) and the short preceding loop illustrate the 
importance of regions lacking secondary structure elements in SAGA. These 
parts of Taf9 interact with many subunits and domains, including the Spt3 C’ 
tail, Spt20 bridge domain, Taf12 histone fold, Adal, Sgf73 anchoring domain, 
Taf6 histone fold, Spt7 histone fold, Taf5 WD40, Taf5 NTD and Taf10 histone 
fold. 


Extended Data Table 1 | Cryo-EM data collection, refinement and validation statistics 


SAGA SAGA-TBP 
EMD-10447 EMD-10438 (overall) 
EMD-10440 (Tra1) 
EMD-10441 (main lobe) 
Data collection and 


processing 
Magnification 105000 105000 
Voltage (kV) 300 300 
Electron exposure (e/A’) 52.8 52.8 
Defocus range (um) 0.6-4.6 0.5-4.9 
Pixel size (A) 1.09 1.09 
Symmetry imposed C1 C1 
Initial particle images 426839 1068534 
(no.) 
Final particle images (no.) 218114 338850 
Map resolution (A) 
Overall 5.92 3.80 
Main Lobe 4.87 3.52 
Tral Lobe 4.52 3.50 
FSC threshold 0.143 0.143 


Map resolution range (A) 


Refinement 

Model composition 
Non-hydrogen atoms 
Protein residues 
Ligands 

R.m.s. deviations 
Bond lengths (A) 
Bond angles (°) 

Validation 
MolProbity score 
Clashscore 
Poor rotamers (%) 

Ramachandran plot 
Favored (%) 
Allowed (%) 
Disallowed (%) 


*In addition a model including Spt8 and DUB fitted into lower-resolution maps was deposited under PDB accession code 6TBM. 


Modelled 
coordinates 
PDB 6TB4 * 


39144 
5144 


0.008 
1.38 


1.86 
5.53 
0.46 


59.5 
10.4 
0.16 
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Gene transcription by RNA polymerase Il is regulated by activator proteins that recruit 


the coactivator complexes SAGA (Spt-Ada-Gcen5-acetyltransferase)’” and 
transcription factor IID (TFIID)? *. SAGA is required for all regulated transcription’ and 
is conserved among eukaryotes®. SAGA contains four modules’ *: the activator- 
binding Tral module, the core module, the histone acetyltransferase (HAT) module 
and the histone deubiquitination (DUB) module. Previous studies provided partial 
structures“, but the structure of the central core module is unknown. Here we 
present the cryo-electron microscopy structure of SAGA from the yeast 
Saccharomyces cerevisiae and resolve the core module at 3.3 A resolution. The core 
module consists of subunits Taf5, Sgf73 and Spt20, and a histone octamer-like fold. 
The octamer-like fold comprises the heterodimers Taf6—Taf9, Tafl0-Spt7 and Tafl2- 
Adal, and two histone-fold domains in Spt3. Spt3 and the adjacent subunit Spt8 
interact with the TATA box-binding protein (TBP)””> ”. The octamer-like fold and its 
TBP-interacting region are similar in TFIID, whereas Taf5 and the Taf6 HEAT domain 
adopt distinct conformations. Taf12 and Spt20 form flexible connections to the Tral 
module, whereas Sgf73 tethers the DUB module. Binding of anucleosome to SAGA 
displaces the HAT and DUB modules from the core-module surface, allowing the DUB 
module to bind one face of an ubiquitinated nucleosome. 


SAGA contains 19 subunits, distributed over four modules’®. The Tral 
module binds activators’””°, the core module recruits TBP”, the HAT 
module contains the histone H3 acetyltransferase Gcn5' and the DUB 
module comprises a histone H2B deubiquitinase”””’. To determine 
the structure of SAGA, we purified the endogenous complex from S. 
cerevisiae using a strain with a C-terminal TAP-tag on subunit Spt20 
(Methods). Purified SAGA contained all 19 subunits in apparently 
stoichiometric amounts (Extended Data Fig. 1a); we analysed this com- 
plex using cryo-electron microscopy (cryo-EM) and protein cross- 
linking analysis (Methods). We obtained a reconstruction of SAGA at 
an overall resolution of 3.9 A (Extended Data Fig. 1, Supplementary 
Video 1). 

The two large SAGA modules—the Tral and the core module—were 
resolved at 3.4 A and 3.3 A resolution, respectively (Extended Data 
Figs. 1d, e, 2a). We fitted the Tral structure”, built the core module and 
the protein regions connecting the two modules and refined the struc- 
ture in real space (Extended Data Tables 1, 2, Extended Data Fig. 2b). 
The HAT and DUB modules were more flexible and were resolved at9 A 
and 12 A resolution, respectively. The structure of the DUB module” 
could be fitted, but density for the HAT module could not be interpreted 
(Supplementary Video 1). Our protein-protein cross-linking analysis 
and previous cross-linking data’ validated our modelling and assigned 
subunit Spt8 to a remaining density located between the core and Tral 
modules (Supplementary Table 1, Extended Data Fig. 2c, d). 


The structure confirms the overall topology of SAGA with four flex- 
ibly connected modules®*” and reveals the intricate subunit architec- 
ture of the coactivator complex (Fig. 1). The SAGA structure contains 
only one copy ofeach subunit, in contrast to TFIID, which contains two 
copies of several subunits**. The SAGA core module occupies a central 
position and comprises the subunits Taf5, Taf6, Taf9, Taf10, Taf12, Spt3, 
Spt7, Spt20 and Adal. The TBP- interacting subunit Spt8 is flexibly con- 
nected to the core module, as are the HAT and DUB modules (Fig. 1a). 
These three functional SAGA regions are lined up on one side of the 
complex that is predicted to face promoter DNA (Fig. 1a). 

The core module contains a histone octamer-like fold and an adja- 
cent submodule formed by subunits Taf5, Taf6 and Spt20 (Fig. 2). The 
octamer-like fold comprises three pairs of subunits that each contrib- 
ute one histone fold, namely Taf6-Taf9, Tafl0-Spt7 and Taf12-Adal, 
and Spt3, which contributes two histone folds. The presence of an 
octamer-like fold explains early observations of histone-like subunit 
pairs in SAGA”*”®. In contrast to a canonical histone octamer, which 
shows twofold symmetry, the SAGA octamer-like fold is fully asym- 
metric (Extended Data Fig. 3a). 

TafS connects the octamer-like fold to the remainder of the core 
module and is thus important for core-module architecture (Fig. 2b). 
The N-terminal helical domain of Taf5 binds the C-terminal HEAT repeat 
region of Taf6. The C-terminal WD40 B-propeller of TafS docks to the 
histone fold pairs Taf6-Taf9 and Tafl0-Spt7, and binds Spt20, which 
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Fig. 1| Overall structure of SAGA. a, Overview of SAGA structure. Low-pass- 
filtered and high-resolution composite cryo-EM maps of SAGA. The four SAGA 
modules Tral, core, HAT and DUB are indicated. b, High-resolution composite 


contains a SEP (shpl-eyc-p47) domain (Fig. 2c). Spt20 also contains 
an extended loop that forms a wedge between the two Taf5 domains, 
thereby stabilizing them in a defined orientation (Fig. 2b, Extended 
Data Fig. 3b). The Lisl homology motif (LisH) helices of Taf5 interact 
with the SEP domain of Spt20 (Extended Data Fig. 3c). Taf6 contrib- 
utes one B-strand to the Taf5 propeller, suggesting that TafS and Taf6 
form an obligate heterodimer (Fig. 2c). The TafS propeller and the Taf6 
HEAT region both interact witha stretch of Spt7 that extends from the 
octamer-like fold and continues into a bromodomain that is mobile. 
SAGA has long been known to bind TBP”, implying a role in 
recruiting TBP to promoters. According to cross-linking’ and genetic 
data’, SAGA interacts with TBP via the Spt3 and Spt8 subunits. Spt3 and 
Spt8 occupy adjacent locations at the edge of the octamer-like fold 
(Figs. 1, 3a). SAGA and TFIID share the subunits Taf5, Taf6, Taf9, TaflO 
and Taf12’*. TFIID consists of three lobes—A, B and C—and has been 
structurally defined?*. Lobe A contains an octamer-like fold that resem- 
bles the fold observed in SAGA. However, Spt7 and Adal are replaced 
by TFIID subunits Taf3 and Taf4, respectively, and the two histone-fold 
domains of Spt3 are replaced by the histone fold pair Taf11-Taf13 in 


a TIR b 


415 486 268 350 
Tatt2 [JF 539 Adat ("ep 488 


449 529 975 1047 


Spt7 1332 


87 196 8 70 211 312 HF 
Taf10 206 spt3 337 
13 77 224 467 


30 101 


Tafé | Heat _ MS 157 


LisH149 281 455 798 
Taf [| Eine wp40 798 


Wedge 


135 224 TIR 
sptzo [sm 11] 604 


Anchor helices 
75 98 218 286 


cryo-EM map reveals Tral and core modules. The subunit colour code is used 
throughout. c, d, Two views of the SAGA structure displayed as a ribbon model. 


TFIID (Fig. 3b). Despite these differences, the octamer-like folds in SAGA 
and TFIID bind TBP at the same relative position (Fig. 3a, b). 

We generated a model of the SAGA-TBP complex by superposing the 
TBP-containing TFIID structure lobe A onto the SAGA core structure 
(Extended Data Fig. 3d). The model is consistent with TBP bridging 
between SAGA subunits Spt3 and Spt8. In TFIID, the TFIID-specific 
subunit Tafl1 also contributes to TBP binding, and this may explain 
an apparently higher affinity of TFIID for TBP compared with SAGA’. 
TFIID lobe B contains a hexamer of histone-fold domains that lacks 
the Taf11-Taf13 pair and does not bind TBP, but otherwise resembles 
its counterpart in SAGA (Fig. 3c). 

Structural comparisons also show how SAGA and TFIID form distinct 
structures despite sharing five subunits (Fig. 3). The shared subunits 
Taf5 and Taf6 have different structures in the two complexes. In TFIID, 
the TafS N-terminal domain docks to the octamer-like fold and is sta- 
bilized by the TFIID-specific subunit Taf4. In SAGA, Taf4 is absent and 
subunit Adal occupies its position. The TafS N-terminal domain occu- 
pies a position that is distant from the octamer-like fold, is stabilized 
by the SAGA-specific subunit Spt20 and contacts the Taf6 HEAT repeat 


Sof73 657 


Fig.2|SAGA core-module structure. a, Subunit domain architecture. Residues 
at domain boundaries are indicated. BD, bromodomain; HF, histone fold; HEAT, 
HEAT repeat domain; NTD, N-terminal domain; WD40, WD40 £-propeller 
domain; SEP, shp1-eyc-p47 domain; ZF, zinc finger domain; SCA7, SCA7 
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Octamer-like fold 


domain. Underlines indicate regions included inthe structure. b, Ribbon 
model showing subunit arrangement and interactions. View and colour code as 
in Fig. 1.c, The Taf5 WD40 propeller domain interacts with six other SAGA 
subunits. 


SAGA 
Core module 


TFIID 
Lobe A 


TBP binding 
“Spts” 


Fig. 3| Comparison of the SAGA core module with TFIID. a, SAGA core module 
with subunits that are shared with TFIID in colour. A magenta dot depicts lysine 
residue K190 of Spt3 that was cross-linked to TBP’’ and is located in the loop 
between the two histone folds of Spt3 (dashed magenta line). b, Comparison 
with TBP-bound TFIID lobe A? shows that TBP binds to the same relative 


domain, which also adoptsa different position and structure compared 
to TFIID. Thus, incorporation of Adal or Taf4 into the octamer-like fold 
may trigger assembly of either SAGA or TFIID, respectively. 

Our results also show that the core module forms flexible connec- 
tions to the other modules of SAGA. The core subunits Taf12 and Spt20 
contain Tral-interacting regions (TIRs) that tether the Tral module. 
The Taf12 TIR (residues 353-410) meanders through a narrow surface 
groove formed by the TPR repeats of the FAT domain in Tral, whereas 
Spt20 contains two TIRs (Extended Data Fig. 4a, b, d). TIR1 (residues 
398-416) forms a latch that retains the Taf12 TIR, and TIR2 (residues 
474-488) forms a helix that binds Tral (Extended Data Fig. 4a, b, d). 
These interactions are consistent with the known dissociation of Tral 
on Spt20 deletion”’. Furthermore, subunit Sgf73 connects the core 
to the DUB module (Extended Data Fig. 4c, d). Whereas the central 
Sgf73 residues 353-437 are part of the core module (and designated 
‘anchor helices’) and form interactions with Spt20, Adal, Taf12, Taf6 
and Taf9, approximately 100 N-terminal residues are part of the DUB 
module” ’. Consistent with this, a Sgf73 region overlapping with the 
anchor helices is required to retain Sgf73 in SAGA”. 

We next investigated how SAGA binds its nucleosome substrate. 
Modelling anucleosome onto the DUB module with the use of the DUB- 
nucleosome structure” resulted ina clash of the nucleosome with the 
core module. Therefore, SAGA needs to change conformation to bind 
the nucleosome. To investigate this, we prepared nucleosomes that 
were ubiquitinated at histone H2B residue K120 (corresponding to K123 
of yeast H2B) and trimethylated at histone H3 residue K4 (Methods). 
We then formed a SAGA-nucleosome complex, and subjected this to 
cryo-EM analysis (Extended Data Fig. 5). 

The cryo-EM data revealed the DUB module bound to the modified 
nucleosomeat 3.7 A resolution (Fig. 4a, Extended Data Fig. 5d-f). The 
obtained structure of the DUB moduleis virtually identical to the known 
structure of the isolated DUB module”. The DUB module binds to one 
face of the nucleosome ina way that is identical to that observed in the 
isolated DUB-nucleosome complex, although DUB modules bound to 
both faces of the nucleosome in this structure” (Extended Data Fig. 5g). 
Further data processing resolved the Tral module at 4.2 A resolution, 
whereas the core module showed low resolution, and the HAT module 


position with respect to the histone-like fold in SAGA and TFIID. The histone- 
like folds are similar but differ in their subunit composition. c, Comparison with 
TFIID lobe B? reveals different structures of Taf5 and Taf6 that are due to 
complex-specific subunits. 


was invisible, suggesting that it became flexible on nucleosome binding 
(Extended Data Fig. 4d). 

Comparison of low-pass-filtered maps shows that nucleosome bind- 
ing displaces the HAT and DUB modules from the SAGA core module 
(Extended Data Fig. Sh). This is probably important for SAGA to fulfil its 
different functions during transcription activation when itis recruited 
by an activator to the promoter (Fig. 4b). Whereas the HAT and DUB 
modules would deubiquitinate and acetylate a promoter-bound nucleo- 
some around or downstream of the transcription start site (TSS), the 
core module and Spt8 recruit TBP to the promoter upstream of the TSS. 
Flexibility between the modules would allow SAGA to bridge between 
promoter regions and to accommodate changes in their distance at 
different promoters. 

Finally, our results have implications for understanding the structure 
and function of related coactivators. The yeast complex SLIK is identical 
to SAGA but contains a C-terminally truncated version of Spt7 and lacks 
Spt8*°*. In our SAGA structure, the Spt7 C-terminal region protrudes 
towards Spt8, suggesting that it contacts Spt8 and explaining why 
SLIK lacks Spt8. SAGA is highly conserved in human cells and contains 


Closed 


BP % 
{jo Activator 
Open 


H3 tail 


Nucleosome Nucleosome 


Fig. 4| Nucleosome binding induces changes in SAGA. a, Structure of DUB- 
nucleosome complex within SAGA. b, Model showing changes in SAGA module 
orientation on nucleosome and promoter binding. 
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counterparts of all yeast SAGA subunits except Spt8’* (Extended Data 
Table 3). Thus, our yeast SAGA structure is a good model for yeast SLIK 
and human SAGA. In conclusion, the structure of SAGA integrates avail- 
able data, reveals differences to TFIID and provides a framework for 
studying the mechanisms used by this multifunctional coactivator to 
regulate transcription. 
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Methods 


Purification of endogenous SAGA 

S. cerevisiae strain CBO10 (MATa pep4::HIS3, prb1::LEU2, prcl::HISG, 
canl, ade2, trp], ura3, his3, leu2-3,112) with a C-terminal TAP tag at Spt20 
was grown ina 200-I fermenter (INFORS-HT) with 100 | YPD medium 
overnight and collected at OD¢o0 = 5. Cell pellets were resuspended in 
lysis buffer (30 mM HEPES pH 7.5, 300 mM NaCl, 1.5 mM MgCl,, 0.05% 
NP40,1mMDTT, 0.284 pg mI leupeptin, 1.37 pg mI pepstatin A, 0.17 
mg ml‘ PMSF, 0.33 mg ml benzamidine) and frozen in liquid nitro- 
gen. Frozen yeast cell beads were milled to powder using a cryogenic 
grinder (Spex sample prep 6875D). The lysed yeast powder was thawed 
and mixed with half the volume of lysis buffer. Lysates were cleared by 
centrifugation (4,000g, 4 °C, 20 min and 235,000g, 4 °C, 60 min). The 
purification was performed as described’’, with several modifications. 
In brief, the supernatant was incubated with IgG Sepharose-6 Fast Flow 
resin (GE Healthcare) at 4 °C for3 h, the resin was washed with 5 column 
volumes of lysis buffer followed by 5 column volumes of TEV cleavage 
buffer (30 mM HEPES pH 7.5, 150 mM NaCl, 1.5 mM MgCL,, 0.05% NP40, 
1mM DTT and 0.5 mM EDTA) and then resuspended in 5 ml of the TEV 
cleavage buffer. TEV cleavage was performed by incubating with His,— 
TEV protease for 16 hat 4 °C. The eluate was loaded onto a1-ml HiTrap 
Qcolumn (GE Healthcare) and eluted witha gradient using as high salt 
buffer 30 mM HEPES pH 7.5, 1M NaCl, 1.5 mM MgCl, 1 mM DTT. Peak 
fractions were concentrated to approximately 1 mg mI. 


Preparation of modified nucleosomes 

To generate the K120-ubiquitinylated histone H2B, we introduced a 
lysine-to-cysteine mutation (K120C) into the Xenopus H2B sequence 
and aglycine-to-cysteine mutation (G76C) to ubiquitin by site-directed 
mutagenesis. The dichloroacetone cross-link was formed between 
ubiquitin and H2B-K120 as described”, with minor changes. In brief, 
100 tM H2B-K120C and 100 pM His,-Ub(G76C) proteins were incu- 
bated at 50 °C in reaction buffer (SO mM borate pH 8.1, 1 mM tris(2- 
carboxyethyl) phosphine (TCEP)) for1hto reduce cysteines, and were 
then cooled on ice for 1h. Dimethyl formamide (DMF) dissolved in 
dichloroacetone was added to the solution to a final concentration 
of 100 uM and incubated on ice for 1h. The reaction was quenched 
with 50 mM B-mercaptoethanol, frozen and lyophilized. The resulting 
product mixture was resuspended in Ni-U buffer (20 mM HEPES pH7.5, 
500 mM NaCl, 6 Murea, 2mM B-mercaptoethanol, 20 mM imidazole) 
and applied to a HisTrap HP 5 ml column (GE Healthcare). The bound 
proteins were eluted with Ni buffer supplemented with 150 mM imi- 
dazole, and dialysed into TEV cleavage buffer. After TEV cleavage for 
16 hat 4 °C, the product was dialysed into Ni-U buffer and reapplied 
toaHisTrap HP5 ml column to remove uncleaved products. The flow- 
through from the column was applied to a HiTrap SP 5 ml column (GE 
Healthcare) and eluted with a gradient of Ni-U buffer with 1M NaCl. 
Peak fractions were pooled and dialysed to water containing 5 mM 
6-mercaptoethanol, frozen and lyophilized. 

H3K4me3 binding by the Sgf29 Tudor domain is required for chro- 
matin targeting and histone H3 acetylation of SAGA™. To generate 
the K4-trimethylated histone H3 variant, a single lysine-to-cysteine 
mutation (K4C) was introduced into the H3 sequence by site-directed 
mutagenesis. Cysteine-engineered histone H3 K4C protein was 
alkylated as described*. In brief, purified protein was reduced with 
DTT before addition of a50-fold molar excess of trimethylammonium 
bromide (Sigma 117196-25G). The reaction mixture was incubated 
for 4h at 50 °C before quenching with 5 mM B-mercaptoethanol. 
The modified protein was desalted using a PD-10 desalting column 
(GE Healthcare) pre-equilibrated in water supplemented with 2 mM 
B-mercaptoethanol and lyophilized. Successful alkylation was con- 
firmed by MALDI-TOF mass spectrometry. The Widom 601145 bp DNA 
was purified as described from the pUC19 8 x 145 bp 601-sequence 
plasmid using the restriction enzyme EcoRV to digest the DNA into 


fragments**. Nucleosomes were reconstituted with modified histones 
and the Widom 601 DNA as described™. 


Cryo-EM sample preparation 

Purified SAGA (or SAGA mixed with the modified nucleosome ata molar 
ratio of 1:2) was incubated with 3 mM BS3 for 1h onice, and quenched 
for 10 min using 10 mM Tris-HCI pH 7.5, 2mM lysine and 8 mM aspartate. 
Quenched samples were applied to a15—40% sucrose gradient in dialysis 
buffer (20 mM HEPES pH 7.5, 150 mM NaCl, 1.5 mM MgCl,, 1 mM TCEP, 
2% glycerol), and ultracentrifuged at 32,000 rpm (SW60 rotor) for 16h 
at 4 °C. Gradients were fractionated in 200 pl and analysed with native 
PAGE. The gels were stained with Syber Gold (Invitrogen) and Coomas- 
sie brilliant blue. Peak fractions containing SAGA or the SAGA-nucleo- 
some complex were dialysed overnight, concentrated to approximately 
0.2 mg mI" and used for grid preparation. Two microlitres of sample was 
applied to glow-discharged UltrAuFoil 2/2 grids (Quantifoil) on each 
side of the grid. After incubation for 10 s, the sample was blotted for 4 
sand vitrified by plunging into liquid ethane using a Vitrobot Mark IV 
(FEI Company) operated at 4 °C and 100% humidity. 


Cryo-EM data collection and image processing 

Cryo-EM data of the SAGA and SAGA-NCP were acquired on a FEI 
Titan Krios transmission electron microscope operated at 300 keV, 
equipped with a K2 summit direct detector and a GIF quantum energy 
filter (Gatan). Automated data acquisition was carried out using EPU 
software (FEI) at anominal magnification of 130,000 or 105,000x, 
resulting in calibrated pixel sizes of -1.05 A and -1.35 A for SAGA and 
the SAGA-nucleosome complex, respectively. Movies of 40 frames 
were collected in counting mode over 9 s with a defocus range of 
1.25-2.75 um. The dose rate was 4.7 e A?s resulting in 1.06 e A? per 
frame for SAGA, and 4.9 e A*s*resultingin1.10 e A? per frame forthe 
SAGA-nucleosome complex, respectively. A total of 4,697 and 4,866 
movies were collected for SAGA and the SAGA-nucleosome complex, 
respectively. Movie stacks were motion-corrected, CTF-estimated and 
dose-weighted using Warp®. 

Particles of the SAGA data were auto-picked by Warp, yielding 
250,368 particle images. Image processing was performed with RELION 
3.0.5°°. Particles were extracted using a box size of 4007 pixels, and 
normalized. Reference-free 2D classification was performed to screen 
for good particles in the dataset. An ab initio model generated from 
cryoSPARC” was used as an initial reference for subsequent 3D clas- 
sification. All classes containing intact SAGA density were combined 
(107,759 particles) and used for a global 3D refinement resulting ina 
map at 4.7 A resolution. To improve the map for the core module of 
SAGA, focused 3D classification without image alignment was per- 
formed using a mask around the core module. The class that showed 
the best density for the core module was subjected to another round 
of 3D refinement resulting in an overall resolution of 4.1A. Focused 
refinement further improved the resolution to 3.4 Aand 3.3 A for Tral 
and the core module, respectively. Post-processing of refined recon- 
structions was performed using automatic B-factor determination 
in RELION and reported resolutions are based on the gold-standard 
Fourier shell correlation (FSC) 0.143 criterion (B-factors of -107 A? and 
-91 A? for the Tral and the core module, respectively). Local resolution 
estimates were obtained using the built-in local resolution estimation 
tool of RELION using the estimated B-factors. 

For the SAGA-nucleosome complex sample, 579,759 particles were 
auto-picked by Warp. As the DUB-nucleosome and the remaining parts 
of SAGA were not present together during the classification steps, parti- 
cles of SAGA in the nucleosome-bound state and the DUB-nucleosome 
were processed separately. Otherwise, the processing procedure was 
thesameas that for SAGA. However, focused 3D classification without 
alignment did not yield good core-module particles from the SAGA- 
nucleosome dataset. A reconstruction at 6.1 A overall resolution was 
obtained from 86,910 particles of SAGA in the nucleosome-bound state. 


Article 


Focused refinement further improved the resolution to 4.2 A for the 
Tral lobe. For the DUB-nucleosome complex, areconstruction at 3.7 A 
overall resolution was obtained from 113,856 particles. Post-processing 
of the refined reconstructions was performed using automated B-factor 
determination in RELION and reported resolutions are based on the 
gold-standard FSC 0.143 criterion (B-factors of -149 A? and -115 A? for 
the Tral lobe and the DUB-nucleosome, respectively). Local resolution 
estimates were obtained using the built-in local resolution estimation 
tool of RELION using the previously estimated B-factors. 


Cross-linking and mass spectrometry 
Samples for cross-linking mass spectrometry were performed essen- 
tially inthe same way as those for cryo-EM. Cross-linked samples were 
purified by sucrose gradient centrifugation and fractions containing 
fully assembled complexes were pooled for mass spectrometry sam- 
ple preparation. For in-solution digest, urea buffer (8 M urea, 50 mM 
NH,HCO, pH 8) was added to pooled fractions toa final concentration 
of1Murea. Samples were reduced with5 mM DTT (in50 mMNH,HCO, 
pH 8) for 30 min at 37 °C, 300 rpm followed by alkylation with 20 mM 
iodoacetamide (in 50 mM NH,HCO, pH 8) for 30 min at 37 °C, 300 rpm, 
inthe dark. The reaction was quenched by addition of 5 mM DTT (in5O 
mM NH,HCO, pH 8). Trypsin digest (Promega, V5111) was performed 
overnight at 37 °C with 1:20 mass ratio (trypsin:complex). Tryptic 
peptides were desalted using C18 spin columns (Harvard Apparatus 
74-4601), lyophilized and dissolved in 30% (v/v) acetonitrile, 0.1% (v/v) 
trifluoroacetic acid. The peptide mixture was separated on a Superdex 
Peptide 3.2/300 (GE Healthcare) column run at 50 pl min™ with 30% 
(v/v) acetonitrile, 0.1% (v/v) trifluoroacetic acid. Cross-linked species 
are enriched by size-exclusion chromatography based on their higher 
molecular weight compared to linear peptides. Therefore, 50-l frac- 
tions were collected from 1.0 ml post-injection. Fractions from 1.0-1.6 
ml post-injection were dried in a speedvac and dissolved in 5% (v/v) 
acetonitrile, 0.05% (v/v) trifluoroacetic acid and analysed by liquid 
chromatography-tandem mass spectrometry (LC-MS/MS). 
LC-MS/MS analyses were performed ona Q Exactive HF-X hybrid 
quadrupole-orbitrap mass spectrometer (Thermo Scientific) coupled 
toa Dionex Ultimate 3000 RSLCnano system. Peptides were loaded on 
a Pepmap 300 C18 column (Thermo Fisher) at a flow rate of 10 pl min™ 
in buffer A (0.1% (v/v) formic acid) and washed for 3 min with buffer A. 
The sample was separated on an in-house packed C18 column (30 cm; 
ReproSil-Pur 120 A, 1.9 xm, C18-AQ; inner diameter, 75 ,1m) at a flow 
rate of 300 nl min”. Sample separation was performed over 60 min (in- 
solution digest) or 120 min (in-gel digest) using a buffer system consist- 
ing of 0.1% (v/v) formic acid (buffer A) and 80% (v/v) acetonitrile, 0.08% 
(v/v) formic acid (buffer B). The main column was equilibrated with 5% 
B, followed by sample application and a wash with 5% B. Peptides were 
eluted by a linear gradient from 15-48% B or 20-50% B. The gradient was 
followed by a wash step at 95% B and re-equilibration at 5% B. Eluting 
peptides were analysed in positive mode using a data-dependent top-30 
acquisition methods. MS1 and MS2 resolution were set to 120,000 and 
30,000 full width at half maximum, respectively. Precursors selected for 
MS2 were fragmented using 30% normalized, higher-energy collision- 
induced dissociation (HCD) fragmentation. Allowed charge states of 
selected precursors were +3 to +7. Further MS/MS parameters were set as 
follows: isolation width, 1.4 m/z; dynamic exclusion, 10 s; max. injection 
time (MS1/MS2), 60 ms/200 ms. The lock mass option (m/z 445.12002) 
was used for internal calibration. All measurements were performedin 
duplicates. The .raw files of all replicates were searched by the software 
pLink 1, v.2.3.1°8 and pLink 2” against a customized protein database 
containing the expressed proteins. Protein-protein cross-links were 
filtered with 1% FDR and plotted using xVis*°. 


Model building 
The structure of the core module was built by first placing the known 
structure of the Taf5-Taf6-Taf9 trimer (PDB ID: 6F3T) into the density 


by rigid-body fitting in Chimera. Adjustments were made to the protein 
sequence in Coot"; insertions and deletions were manually built accord- 
ing to the density. The histone-fold domains of Taf10, Spt7, Taf12, Adal 
and Spt3 and extensions from them were manually built. The structure 
of the TafS NTD (PDB ID: 2J49) and Taf6 HEAT domain (PDB ID: 4ATG) 
were placed into the density and adjusted in Coot. The remaining parts 
were built manually. Secondary structure predictions from PSIPRED 
were used to assist de novo modelling. a-helices were generated using 
Coot and manually fitted into the density. Linkers between the helices 
were modelled where clear density was visible. Cross-linking restraints 
and densities from bulky residues suchas Lys, Arg, Phe, Tyr and Trp were 
used to guide modelling. The SEP domain of Spt20 shares structural 
homology with human p47 (PDB: 1SS6), and this structure guided Spt20 
modelling. The structure of the Tral module was built by placing the 
structure of Tral (PDB ID: 50JS) into the density by rigid-body-fittingin 
Chimera, and the TIRs of Taf12 and Spt20 were manually built in Coot 
based on the density and cross-linking restraints. The DUB-nucleo- 
some structure (PDB ID: 4ZUX) was placed into the corresponding 
densities by rigid-body-fitting the DUB module and nucleosome in 
Chimera. All models were subjected to alternating manual adjustment 
and real-space refinement using Coot and PHENIX™, resulting in good 
stereochemistry as assessed by Molprobity*’. Figures were generated 
in PyMOL (Schrédinger, v.2.2.2) and UCSF Chimera (v.1.13). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The electron density reconstructions and models of the complete 
SAGA complex, the Tral module, the core module, the DUB module- 
nucleosome complex and the nucleosome-bound state of SAGA were 
deposited with the Electron Microscopy Data Bank (accession codes 
EMD-10412, EMD-10413, EMD-10414, EMD-10415 and EMD-10416 respec- 
tively) and with the Protein Data Bank (accession codes 6T9I, 6T9J, 
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Extended Data Table 1| Cryo-EM data collection, refinement and validation statistics 
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Extended Data Table 2 | Modelling of yeast SAGA subunits, domains and regions 
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Extended Data Table 3 | Conservation of SAGA between yeast and human 
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Subunits Subunits similarity* 
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*The HHpred similarity scores are calculated between homologous regions only. 
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In Extended Data Fig. 2 of this Letter, the tyrosinase guide RNA sequence 
in the top panel was incorrectly annotated such that it was shifted 
8 nucleotides to the left to align with a neighbouring protospacer 
adjacent motif (PAM) site. The error had no consequence for the 
purpose of communicating the genotypes of offspring of Rosa26-cas9 
and H11-cas9 breeding strategies. The guide RNA sequence was cor- 
rectly presented in the Supplementary Methods and in Supplementary 
Fig. 2. Extended Data Fig. 2 has been corrected online. We would like 


to thank the reader who brought this to our attention. 
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M. Zemp, M. Huss, E. Thibert, N. Eckert, R. McNabb, J. Huber, 
M. Barandun, H. Machguth, S. U. Nussbaumer, I. Gartner-Roer, 
L. Thomson, F. Paul, F. Maussion, S. Kutuzov & J. G. Cogley 


In this Letter, the regional mass changes for Iceland were erroneously 
computed from the temporal variability of the glaciological sample 
from the Low Latitudes (instead of Iceland). Nevertheless, long-term 
mass changes were correctly derived from geodetic values from Iceland. 
Recalculation based onthe correct data results ina larger inter-annual 
variability for Iceland (with a minimum mass balance in the hydrologi- 
cal year 2009/10 instead of 2015/16), a larger mass loss over the full 
observation period from 1961 to 2016 (-3.7 gigatonnes (Gt) yr instead 
of -2.4 Gtyr’), andasmaller ice loss over the last decade from 2006 to 
2016 (-5.3 Gt yr ‘instead of -7.5 Gt yr). At global levels, however, these 
corrections result in changes of less than 1% and are well within the 
error bars. ASaconsequence, this error did not affect any of the conclu- 
sions in the Letter. In Table 1, in the ‘06 Iceland (ISL)’ row, the specific 
mass change (m water equivalent (w.e.) yr“) should have been -0.50 
+ 0.37 (instead of -0.71 + 0.43), and the mass change (Gt yr’) should 
have been —5 + 4 (instead of —8 + 5); in the ‘Total, excl. GRL and ANT’ 
row, the specific mass change should have been —0.55 + 0.04 (instead 
of -0.56 + 0.04), and the mass change should have been —267 +19 
(instead of -270 +19); and in the ‘Global total’ row, the specific mass 
change should have been -0.47 + 0.20 (instead of -0.48 + 0.20), and 
the mass change should have been —332 + 144 (instead of -335 + 144). 
These errors have all been corrected online. For regional comparisons 
with other studies, we provide a new version with corrected data files 
inthe Zenodo repository (https://doi.org/10.5281/zenodo.3557199). 
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Philippe Mauffrey, Nicolas Tchitchek, Vilma Barroca, 
Alexis-Pierre Bemelmans, Virginie Firlej, Yves Allory, 
Paul-Henri Roméo & Claire Magnon 


In this Article, owing to an error during the production process, in 
Fig. 2c, ‘MASHI’ was inadvertently shown as ‘MASTI’ In the main text, 
‘(Fig. 2f, right)’ and ‘(Fig. 2f, left)’ should have been ‘(Fig. 2e, right)’ 
and ‘(Fig. 2e, left)’. In addition, in Fig. 3h, the black lines indicating the 
P-value comparisons were located incorrectly, and should have been 
touching the tops of the blue and red bars. These errors have been 
corrected online. 

Author Virginie Firlej should have been associated with the affilia- 
tion‘U955 (INSERM), Hépital Henri Mondor-Université Paris-Est, Paris, 
France’ instead of ‘Department of Pathology, Hépital Henri-Mondor- 
Université Paris-Est, Paris, France’, and author Yves Allory should have 
been associated with only the affiliation ‘Department of Pathology, 
Institut Curie, Paris, France’, and not with ‘Department of Pathology, 
H6opital Henri-Mondor-Université Paris-Est, Paris, France’. Finally, the 
first name of author Alexis-Pierre Bemelmans was incorrectly shown 
as ‘Alexis’. The original Article has been corrected online. 
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Leading a diverse team requires effective communication and organization. 


FIVE POWER SKILLS FOR 
BECOMING A TEAMLEADER 


Volunteering with an organization can improve communication and 
help you adapt to the unexpected. By Sarah Groover and Ruth Gotian 


any scientists will oversee a team at 

some point in their careers, whether 

it is one or two undergraduates 

doing a summer internship, an 

entire research group, or a depart- 

ment with students, technicians and postdoc- 

toral researchers. Scientists are trained in their 

discipline, but are rarely, if ever, trained inhow 

to manage and mentor trainees. All too often, 

this results ina series of trials and errors that 
are frustrating to both mentor and protégé. 

Based on our respective experiences, we 

have come to recognize how a skilled leader 


can motivate and help trainees to reach their 
full potential. One of us (S.G.) isan immunology 
PhD student at Oklahoma State University in 
Tulsa and volunteers as vice-president of the 
American Physician Scientists Association 


“Identifying and 
developing ‘power skills’ 
can help you tosucceed 
in asupervisory role.” 
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(APSA) in Westford, Massachusetts. The other 
(R.G.) has nearly three decades of experience 
in directing leadership and mentoring devel- 
opment programmes in higher education and 
academic medical centres. 

Identifying and developing ‘power skills’ 
— the crucial abilities that enable a manager 
to connect with people, communicate 
effectively, adapt to the unexpected and 
be open-minded — can help you to succeed 
in a supervisory role. It is never too early 
or too late to develop these competencies. 
Taking on a leadership role in a national 
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Work / Careers 


organization as a student or an early-career 
scientist can be a great way to cultivate 
your leadership abilities. We’ve compiled 
a list of five power skills that we feel are 
integral to the success of a team leader. 


Teamwork. It is essential that you and your 
team have mutual trust and respect. You 
need to motivate and utilize the individual 
strengths of team members. When assigning 
tasks or providing feedback, it is important 
that your members know that their con- 
tributions are valued and that their time is 
appreciated. Serving ona planning commit- 
tee for an organization’s annual conference 
can be a great way to practise good team- 
work. Multiple components are required 
to create a successful event, from branding 
and advertising to fundraising and account- 
ing. Capitalize on your colleagues’ strengths 
and passions, and align tasks accordingly. 


Communication. This is a key component 
to team success. When working with people 
based around the country or internationally, 
it is important to communicate effectively 
so that everyone is on the same page. We’ve 
experienced the negative impacts of poor 
communication when organizing a national 
meeting, ranging from insufficient advertis- 
ing of the event to forgetting to keep records 
of travel-grant winners. The former caused 
the event to have a low turnout, despite the 
excessive amount of time and effort that went 
into the planning. The latter led to confusion 
and embarassment when the award recipient 
wanted to use funding that the organizers were 
unable to provide. The ability to listen to the 
different needs and viewpoints of your team 
is essential. Always ask your co-workers for 
their perspectives, even if they’re not initially 
offered. 


Commitment and reliability. To develop a 
strong reputation as a team leader, it’s better 
to under-promise and over-deliver. Leader- 
ship can include finding responsible people to 
staff committees, which is always a struggle. 
And spreading many responsibilities among 
afew people is challenging, particularly when 
deadlines are looming. It is important to rec- 
ognize that as part ofa team, your actions can 
impede the mission of an organization, lead- 
ing to the late completion of certain initiatives 
or making the group appear unprofessional 
to outside members or partner societies. 
Your disorganization should not be some- 
one else’s emergency. I (S.G.) had planned to 
run for president of APSA after my term as 
vice-president ends. However, the prerequi- 
site president-elect and president positions 
will coincide with my first and second years 
of medical school (I start in the autumn). 
Instead of taking on too much, which might 
sacrifice quality in either commitment, 
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Ruth Gotian runs US leadership programmes in higher education. 


I made the difficult decision to take on a 
more manageable role in the organization. 


Adaptability. Being flexible and innovative 
are key to a leader’s success. When resources 
are lacking or a committee is understaffed, 
emergencies occur. Being able to bounce 
back exemplifies how to stay optimistic and 


“Being flexible and 
innovative are key to 
aleader’s success.” 


focus on the solution. These opportunities 
canalso be used to learn new skills and take on 
assignments. I (S.G.) have always considered 
myself technologically challenged, but when 
Ineeded to make some changes to the APSA 
website, | worked it out. Don’t be afraid to step 
out of your comfort zone and learn something 
new inthe process. 


Open-mindedness and empathy. The most 
productive scientific groups are diverse and 
capitalize on their group’s ability to bring 
different, creative or unconventional solu- 
tions to difficult questions. Volunteering 
with a national organization such as APSA 
has allowed me (S.G.) to work with a diverse 
group of individuals who bring a range of cul- 
tures, ages, backgrounds and perspectives to 
the table. Post-conference surveys highlighted 
that APSA members have varied interests and 
concerns that were not being addressed. We 
took strides to correct these points by cre- 
ating groups and assigning them various 
initiatives, from mentoring to diversity. In 
turn, the teams created diversity summits and 
virtual mentoring programmes. In your own 
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discussions, be sure to include and amplify the 
voices of people who are usually overlooked; 
they have unique insights and opinions, and itis 
the responsibility of the team leader to ensure 
that their voices are heard. At APSA meetings, 
the mostjunior person atthe table speaks first. 


Being proficient in power skills such as the 
ones we have mentioned will equip you to 
effectively manage groups, projects, space 
and money so that everyone is motivated to 
give their best and maintaina healthy mindset 
throughout. Being a leader requires you to be 
reliable, self-aware, open to feedback and trust 
your team. 

Opportunities to develop your leadership 
skills exist. Find an organization (preferably 
one you are already involved in) witha mission 
that aligns with yours. Spend some time 
working in various areas of the organization, 
becoming familiar with its structure, function 
and culture, and don’t be afraid to take ona 
project outside your comfort zone. Use this 
as an opportunity to develop your leadership 
abilities, as well as to network in your area of 
expertise. Being talented in your field is impor- 
tant, but your impact can be much greater if 
you also know how to lead and motivate the 
next generation. 


Sarah Groover is a PhD candidate studying 
cancer epidemiology and immunology at 
Oklahoma State University Center for Health 
Sciences in Tulsa, Oklahoma, and vice- 
president of the American Physician Scientists 
Association in Westford, Massachusetts. Ruth 
Gotian is the assistant dean for mentoring and 
executive director of the Mentoring Academy 
at Weill Cornell Medicine in New York City, 
and chief learning officer in its department of 
anaesthesiology. 
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Sr Z ; 
his is part of the Kerguelen Islands 
in the southern Indian Ocean, one 
of the most remote archipelagos 
in the world. It’s more than 
3,000 kilometres from Madagascar, 

the nearest populated place. When | did 

my PhD work there studying the Antarctic 

fur seal (Arctocephalus gazella), \travelled 

with two other researchers to our field site, 

Cap Noir, by helicopter. None of us, including 

the pilot, had ever been there. 

It was very foggy, but the seal colony that 
we were looking for finally appeared out of 
the mist. As the helicopter hovered, we three 
researchers jumped out with our jackets and 
computers. We were standing in the mist and 
wind, hearing seals’ barks and pups’ bleats, 
smelling the fresh and wild ocean air. 

It was one of those times when I've felt what 
it’s like to live nearly alone among marine 
animals, and have sensed just how little we 
know about these species. It was like living in 
a David Attenborough wildlife documentary. 

I’ve spent my career working in remote 
places, mostly near the Antarctic. By tracking 
marine predators — birds, seals, whales — we 
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now understand much better than we did 
25 years ago how all these species and their 
habitats are connected, from temperate 
waters to polar oceans. We document the 
behaviour of the animals at sea — how far they 
travel and dive, what they eat and how they 
respond to extreme weather events such as 
marine heatwaves. 

On Kerguelen, I walked through the seal 
colony twice a day to do aroll call; watched 
how the pups behaved when their mothers 
were out at sea; and placed satellite tags and 
time-depth recorders onthe mums. 

But there’s nothing more special than 
awild animal coming to investigate you, 
such as when ajuvenile New Zealand sea 
lion (Phocarctos hookeri) took my notebook 
out of my hand. Those moments bolster 
my resolve to conserve these wildernesses. 
Watching a wild animal’s behaviour in its 
natural habitat is what makes me tick. 


Mary-Anne Lea is a behavioural ecologist at 
the Institute for Marine and Antarctic Studies, 
University of Tasmania, in Hobart, Australia. 
Interview by Kendall Powell. 
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Scientists study the microbiome 
to work out its role in health and 
disease. Credit: Antoine Doré 


earenotalonein our bodies. Living inside every personare 
trillions of microorganisms — bacteria, viruses, fungi and 
other life forms that are collectively known as the microbi- 
ome. Various organs have distinct microbial inhabitants, 
but the group that has attracted the most attention in bio- 
medical research is the one in the gut. 

To better grasp the part that gut microbes play in health and disease, 
researchers from around the globe are investigating what makes a ‘good’ 
gut microbiome (see page S6). There are, after all, hundreds of distinct 
bacterial species in the gut — some pathogenic and some beneficial. 
Computational biologist Eran Segal argues that collecting microbiome 
data would allow a ‘deep phenotyping’ approach that could transform 
drug discovery (S19). And the study of some health-promoting probi- 
otic species is yielding biological insights that might promote drug 
development (S9). 

Several diseases are now thought to be influenced by processes inthe 
gut microbiome. Those include cancer (S16), autoimmune disorders 
suchas multiple sclerosis (S12) and autism spectrum disorder (S14). The 
gut microbiome also strongly interacts with certain drugs, including 
some mental-health therapeutics, and influences their effects (S10). 

With evidence mounting of the gut microbiome’s health significance, 
synthetic biologists are looking to engineer the microbiome — both 
at the individual-species level and as an ecosystem — to thwart the 
development of disease (S20). There is also growing public interest in 
howthe gut microbiome can be influenced — often focused on personal 
dietary choices. Microbiologist Peter Turnbaugh reframes this as a 
question not of which foods will benefit our health, but rather what 
medical insights might be gleaned from the interactions between our 
gut microbes and what we eat (S23). 

Much more research is under way on the gut microbiome than can 
be covered in this Outlook, but this supplement gives a taste of the 
breadth of this robust field (S24). 

We are pleased to acknowledge the financial support of 
Danone Nutricia Research in producing this Outlook. As always, Nature 
retains sole responsibility for all editorial content. 
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The hunt forahe 


Despite evidence of the gut microbiome’s role in human health, researchers are 
still working out what shapes the community of microbes. By Michael Eisenstein 


hat does a healthy forest look like? 
A seemingly thriving, verdant 
wilderness can conceal signs of 
pollution, disease or invasive 
species. Only an ecologist can 
spot problems that could jeopardize the long- 
term well-being of the entire ecosystem. 

Microbiome researchers grapple with the 
same problem. Disruptions to the commu- 
nity of microbes living in the human gut can 
contribute to the risk and severity of a host of 
medical conditions. Accordingly, many scien- 
tists have become accomplished bacterial nat- 
uralists, labouring to catalogue the startling 
diversity of these commensal communities. 
Some 500-1,000 bacterial species reside in 
each person’s intestinal tract, alongside an 
undetermined number of viruses, fungi and 
other microbes. 

Rapid advances in DNA sequencing tech- 
nology have accelerated the identification of 
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these bacteria, allowing researchers to create 
‘field guides’ to the species in the human gut. 
“We're starting to get a feeling of who the play- 
ers are,” says Jeroen Raes, a bioinformatician at 
VIB, alife-sciences institute in Ghent, Belgium. 
“But there is still considerable ‘dark matter’” 
Currently, these field guides are of limited 
use in distinguishing a healthy microbiome 
from an unhealthy one. Part of the problem is 
the potentially vast differences between the 
microbiomes of apparently healthy people. 
These differences arise through a complex 
combination of environmental, genetic and 
lifestyle factors. This means that relatively 
subtle differences can have a disproportion- 
ate role in determining whether an individ- 
ual is relatively healthy or at increased risk 
of developing disorders such as diabetes. 
Understanding the clinical implications of 
those differences is also a challenge, given 
the extensive interactions between these 
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microbes, and with their host, as well as the 
conditions in which that individual lives. “One 
person’s healthy microbiome might not be 
healthy in another context — it’s a tricky con- 
cept,” says Ruth Ley, a microbial ecologist at 
the Max Planck Institute for Developmental 
Biology in Tubingen, Germany. 

Researchers suchas Ley are trying to better 
understand the forces that shape the human 
gut microbiome — bothinthe modernera, and 
across evolutionary history. The emerging 
picture indicates that even if there is no one 
healthy microbiome, there are ample oppor- 
tunities for our lifestyle to interfere with the 
proper function of these complex commen- 
sal communities. And to understand how the 
breakdown of these ecosystems drives dis- 
ease, researchers will need to move beyond 
microbial field guides and begin dissecting 
how these species interact with their hosts and 
with each other. 


ILLUSTRATION BY ANTOINE DORE 


A mother’s first gift to her newborn is a 
healthy smattering of microbes. Some are 
passed along through breastfeeding and 
skin-to-skin contact, but many microbes are 
acquired during passage through the birth 
canal. This means that if the baby is delivered 
by caesarean section, they might miss out ona 
valuable bacterial starter kit. Because a child’s 
earliest years generally establish the compo- 
sition of a gut community that will persist 
throughout adulthood, the resulting disrup- 
tions can have serious long-term health con- 
sequences. “As these infants grow, they have 
higher risk of obesity, and of modern plagues 
like diabetes, allergies and asthma,” says Maria 
Gloria Dominguez-Bello, a microbiologist at 
Rutgers University in New Brunswick, NewJer- 
sey. Ina small clinical study, her team found 
that swabbing newborns delivered by c-section 
with fluids from their mother’s birth canal 
could help to mitigate some of the lost micro- 
bial diversity’. Several larger trials are under 
way to assess the longer-term health benefits. 

Environmental exposures early in life also 
strongly affect a child’s microbiome. Susan 
Lynch, a microbiome researcher at the Uni- 
versity of California, San Francisco, has been 
exploring links between environmental fac- 
tors during childhood and the subsequent 
risk of developing allergies and asthma. Her 
findings indicate that new parents shouldn't 
be afraid of a little dirt — or fur. After moni- 
toring a cohort of nearly 1,200 infants, Lynch 
and her colleagues found that a dog might be 
a baby’s best friend when it comes to avoid- 
ing respiratory disorders’. “The only factor 
that discriminated high- from low-risk groups 
was dog ownership,” says Lynch. She says that 
dogs (and, to alesser extent, cats) “increase the 
diversity of bacteria and lower the diversity 
of fungi in the houses where these babies are 
raised”. This finding aligns with other research 
showing that a rural upbringing or growing 
up ona farm might yield a richer gut micro- 
biome that reduces the risk of inflammatory 
respiratory diseases relative to children raised 
in more urban environments. 

At acertain point during childhood, the 
composition of the gut microbiome generally 
stops changing — although precisely when is 
unclear. A study in 2012 surveyed gut microbes 
from individuals in Malawi, Venezuela and the 
United States, and found a striking pattern’. 
“By three, you can no longer tell the babies 
from the adults,” says Dominguez-Bello, who 
was a co-author on the paper. However, she 
notes that there is also evidence that the 
microbiome remains somewhat mutable 
beyond this point. What is clear is that by 
adulthood, this ecosystem reaches a state of 
equilibrium. “It’s very stable,” says Eran Segal, 


a computational biologist at the Weizmann 
Institute of Science in Rehovot, Israel. “We see 
changes, but you will still look mostly similar, 
even over many years.” 

Some of the changes seen in adulthood are 
driven by environment and lifestyle. Ina 2018 
study of 1,046 ethnically diverse adults living 
in Israel, Segal demonstrated microbial dif- 
ferences that had little to do with ethnicity‘. 
“Environmental inputs could account for 
20-25% of the variability in the microbiome,’ 
says Segal. Drugs are an obvious source of 
disruption, and antibiotics — taken either 
deliberately to fight infection or unwittingly 
in processed foods — can profoundly affect the 
microbiota. Even drugs with no clear roleincon- 
trolling bacteria can cause perturbations. Raes 
notes that one major European microbiome 
study was confounded by unexpected effects 
from the diabetes drug metformin’. 

Diet is also a powerful external influence, 
even if the precise mechanisms by which it 
exerts its effects remain unclear. One study 
in 2018 found that immigrants to the United 
States from Thailand experienced a strik- 
ing ‘westernization’ of their gut flora — a 
transformation that could be, at least in part, 
attributed to adopting a US diet®. 


Mismatched to modernity 


The changes observed in immigrants from 
Thailand were accompanied by increased 
risk of obesity. The study did not establish a 
causal link, but the results are consistent with 
an increasingly popular hypothesis that urban- 
ization — and modern life in general — might be 
highly disruptive to the tight-knit relationship 
that has evolved between humans and their 
microbes. “We have made the assumption that 
the Western microbiome of a healthy person 
isa healthy microbiome,’ says microbiologist 
Justin Sonnenburg at Stanford University 
in California. Instead, he and others think 
that the intersection of diet, antimicrobial 
precautions and general hygiene leads toa 
culling of the gut community, and that this 
disruption might contribute to the elevated 
risk of chronic disease in industrialized soci- 
eties. “This combination of Western diet and 
depleted microbiome has likely led to a sim- 
mering inflammatory state,” Sonnenburg says. 

Several studies have identified a stark differ- 
ence between the microbiota of urban popula- 
tions and those of Indigenous populations that 
lead traditional agrarian or hunter-gatherer 
lifestyles, which more closely resemble those 
of our early ancestors. These differences seem 
to be attributable mainly to loss of bacterial 
diversity, which might be linked to the lack of 
fibre in Western diets. The Hadza, a popula- 
tion of hunter-gatherers living in Tanzania, 
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eat 100-150 grams of dietary fibre per day, 
Sonnenburg says — ten times as much as a 
typical personinthe United States. Asaresult, 
fibre-digesting bacteria such as those belong- 
ing to the genus Prevotella, which can form up 
to 60% of the gut microbiome in non-Western 
populations, are much less abundant in the 
United States. Sonnenburg’s team has demon- 
strated how these changes can become firmly 
entrenched ina population over the course of 
just a few generations’. Mice colonized with 
human microbiota and fed a low-fibre diet 
lost microbial species that remained in mice 
eating a high-fibre diet. When the offspring of 
the low-fibre-diet mice were given a high-fibre 
diet, the species loss was reversible, but after 
four generations, the missing bacteria were 
gone for good. 

Katherine Amato, an anthropologist at 
Northwestern University in Evanston, Illinois, 
has been trying to get to the evolutionary root 
of ahealthy human microbiome by studying 
non-human primates and tracing the effects 
of changes in human lifestyle and physiol- 
ogy. In general, Amato says, similarities in 
microbiome composition among primate 
species are closely mapped to their evolu- 
tionary relatedness. But ina 2019 comparative 
analysis, Amato found that components of the 
human microbiota (in particular, microbes 
from people living in non-industrialized 
societies) did not map as closely as expected 
to those of our nearest relatives — the great 
apes, chimpanzees and bonobos*. Instead, 
the microbiota bore a striking resemblance 
tothose of baboons — amore distant relation, 


“One person’s healthy 
microbiome might not be 
healthy in another context.” 


but one that has alifestyle more similar to that 
of early humans. “Most great apes are living in 
rainforests and eating fruit diets,” says Amato, 
“but we tend to think of our ancestors as living 
in open woodlands or savannah habitats, and 
eating an omnivorous diet — like baboons”. 
This suggests that dietary and environmental 
factors have played a prominent part in 
shaping the human microbiome. 

Ley thinks that the microbiome offers a 
powerful mechanism for adapting quickly 
to lifestyle changes — at least, relative to the 
normal glacial pace of evolution. Indeed, her 
group has found evidence of microbiome 
adaptation in response to the evolution of 
lactose tolerance’ and digestion of high- 
starch diets — genetic adaptations that have 
emerged only in certain populations over the 
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Growing up with a dog in the house increases the diversity of bacteria children are exposed to. 


past 10,000 years or so. But if changes happen 
quickly, as demonstrated by the rapid indus- 
trialization that occurred over the past few 
centuries, the historically healthy relationship 
between host and microbiome could become 
maladaptive as species that the body might 
have evolved to rely on are lost. “Antibiotics 
and sanitation have been key in controlling 
infectious diseases,” says Dominguez-Bello, 
“put have the collateral, unintended conse- 
quences of harming our good microbes.” 


Seeing the forest 


Although researchers have gained a better 
understanding of what human gut microbi- 
omes look like, they are still struggling to pin 
down which components are essential to our 
well-being. One problem is that there are far 
too few datasets to allow researchers to draw 
statistically robust connections between the 
microbiome and health or disease. Segal draws 
acomparison with the human genome — only 
when many high-quality sequences were avail- 
able did it begin to offer clinical value. “There 
are probably 30 million people that have been 
genome sequenced up until today, while inthe 
microbiome there are around 10,000 samples 
publicly available,” he says. 

This issue is compounded by the geographic 
bias in microbiome data. Beyond a handful of 
studies of selected groups such as the Hadza, 
most data are from the United States, Europe 
and China. “We know very little about micro- 
biome variation in Africa, southeast Asia and 
South America,’ says Raes. That information 
gap will be especially relevant for under- 
standing the extent of the suggested ‘missing 
microbes’ problem in the industrialized world. 

A larger, more global data set would give 
a better-informed starting point for broadly 
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understanding what a normal microbiome in 
a healthy individual can look like — and thus 
make it easier to recognize disease-linked per- 
turbations. But researchers also need to move 
beyond studies that simply assess correlation 
onthe basis of presence or absence ofa specific 
microbe ina healthy individual ora person with 
a disease at a particular point in time. 

There are now a number of multi-year, 
longitudinal studies that monitor both the 
health and the microbiome composition of 
many individuals over extended periods. 
The Canadian Healthy Infant Longitudinal 
Development study, for example, is monitor- 
ing more than 3,400 children over the course 
of 5 years in an effort to identify factors that 
contribute to conditions such as asthma 
and allergies. “If we can see that a micro- 
biome change precedes a clinical change, 
then maybe we can establish causality,” says 
Segal. Such patterns would give clinicians 
more confidence in the potential value of a 
diagnostic result or intervention, and would 
be invaluable for studying the contribution 
of the microbiome to chronic conditions that 
manifest gradually, such as diabetes. 

Researchers are also making their bacterial 
censuses more detailed. Early microbiome 
investigations were limited by the narrow 
range of intestinal species that scientists could 
grow in the lab. But the plummeting cost of 
sequencing has made it possible to capture 
detailed snapshots of the DNA extracted 
from faecal microbes. Researchers can now 
go beyond species level to identify strains of 
bacteria, and even genomic variants in those 
strains. Sonnenburg, for example, is using this 
approach to look for mutations that might 
affect the metabolic activity or dietary pref- 
erences of different gut microbes. 
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Many microbes still slip through the net, 
however. Standard methods of microbiome 
analysis favour identification of bacteria, 
and are not as good at identifying other com- 
mon gut microorganisms. “We rarely see 
signatures of fungi in our data, but we know 
they’re there,” says Lynch. “And we know that 
they’re contributing to the overall interaction 
between microbiome and host.” Alternative 
microbiome-analysis techniques offer a 
workaround. Harvesting and analysing RNA 
rather than DNA, for example, allows inves- 
tigators to capture changes in gene expres- 
sion that can reveal dysfunction in ostensibly 
normal gut species. “A perfectly nice-look- 
ing microbiome might be doing things that 
aren't healthy,” says Ley. Other researchers are 
turning to metabolomic techniques — com- 
prehensive chemical analysis of the various 
biomolecules produced in a microbiome 
sample. This is allowing researchers to eaves- 
drop on how microbes are communicating 
with each other and with their host’s cells. 
“These molecules are the end products,” says 
Lynch. “That’s where the meat is in trying to 
define biomarkers of a healthy microbiome.” 
Her lab has made important strides with such 
approaches, including homing in ona micro- 
bial lipid known as 12,13-diHOME, which seems 
to bea driver ofinflammationin infants at high 
risk of asthma’”’. 

Such data might offer the best readout 
yet of how well our internal ecosystem is 
thriving — essentially, inspecting the soil, 
water and leaves of the forest, rather than 
simply counting the trees. “There won't be 
‘the’ healthy microbiome, just like there’s no 
perfect genome,” says Segal. “There could 
be multiple healthy configurations.” These 
profiles of microbial activity might prove 
the fastest route to validating hypotheses on 
microbiome function and dysfunction, and 
accelerate the translation of discoveries into 
clinical trials. “The time of observation hasn’t 
come to an end, but I think it’s really time to 
move to interventions,” says Raes. “You can 
only understand asystem if you give it a good 
kick and see what happens.” 


Michael Eisenstein is a science journalist in 
Philadelphia, Pennsylvania. 
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Michael Fischbach: 


Homing in on the molecules from microbes 


Michael Fischbach, a bioengineer at 
Stanford University in California, describes 
the small molecules produced by microbes 
in the gut as a surprisingly unexamined 
field. To explore it, his team has developed 
a way to ‘toggle’ the molecules on and off to 
find out exactly what they do. 


How is your work helping you to investigate 
the molecules made by gut microbes? 

It gives us a way to study the effects of 
molecules produced by the gut microbiome, 
one molecule at a time. We developed our 
genetic system for a commensal strain of 
Clostridium that produces alot of molecules 
that are characteristic of the microbiome. 
They are made in very high numbers and 
they show up in the host’s bloodstream, but 
there was no good way of working out what 
they do. Our genetic system can stop the 
production of specific molecules; when we 
colonize mice witha mutant bacterial strain, 
the mice are deficient in those molecules. 
We then compare them with mice colonized 
by the wild-type bacterial strain and that do 
have the molecules, and look for differences. 


What type of molecules are you studying? 
We are focusing on branched, short-chain 
fatty acids. These are chemically similar to 
acetate, propionate and butyrate — well-stud- 
ied short-chain fatty acids from commensal 
bacteria. But the ones we are looking at are 
produced by a different metabolic route — 
they come from branched-chain amino acids 
and not much is known about them. 


Why did you focus on these fatty acids? 

We are trying to look at molecules that are 
produced in great abundance. They are 
naturally present in the body at concentra- 
tions similar to those of therapeutic drugs. 
So there is no question about whether these 
molecules end up permeating the host — they 
do. There are dozens of them, and we feel it 
is warranted to take the time to treat these 
molecules almost like a classic pharmacol- 
ogy problem: you have a molecule and you 
want to know everything about what it does. 
So we try to find out what tissues it acts in, 
what receptors it interacts with and how that 
brings about an effect on host biology. It’s 
really fundamental stuff that I think is going 
to end up in textbooks one day. 


sale 


Michael Fischbach investigates the effects of molecules produced by gut microbes. 


What have you caught the molecules doing? 
Weare finding a range of immunological differ- 
ences that can be attributed to the presence or 
absence of individual molecules produced by 
microbes. For example, the branched, short- 
chain fatty acids seem to modulate the activity 
of cells that produce the protein immunoglob- 
ulin A, which is involved in barrier protection 
in mucous membranes. We don’t have the 
complete story yet, but we are starting to work 
out the mechanism of that effect. My guess is 
that, in addition to modulating the immune 
system, the molecules do other things too. We 
just haven't had time to explore it all yet. 


Could microbial molecules lead to drugs that 
will replace whole microbe probiotics? 

Both options are onthe table. If we can identify 
the receptors that microbial molecules bind 
to, we might be able to design drugs that inter- 
act with those receptors. We could mimic the 
effects microbial molecules have on the recep- 
tors, or we might want to block their effects. 
Equally, we could also use the microbes them- 
selves. The use of whole microbes could mean 
delivering the effects of several molecules at 
atime, and could ensure more lasting effects. 
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There is little precedent for using microbes 
as drugs, and doing so will bring regulatory 
challenges. But we will probably see both 
approaches moving forward. And bothcould 
be very powerful. 


Where do you see this kind of work heading? 
We have used bacterial genetics to create 
a clean system for comparing two groups 
of mice that differ only in the presence or 
absence of a microbiome-derived molecule. 
But we did this in a rather unnatural setting: 
a mouse with only one bacterial species in 
its gut. Now, the biggest need is to create a 
model system for experiments such as these: 
bacterial communities that are completely 
defined, but complex enough to be similar 
to the native gut community. Using a com- 
munity that complex, we could carry out 
experiments in which we remove one bac- 
terial species — or even one gene from one 
bacterial species. Those are more likely to 
stand the test of time. 


Interview by Andrew Scott 


This interview has been edited for length and 
clarity. 
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Gut reaction 


Drugs and the microbiome can change each other in 
complex and little-understood ways. By Neil Savage 


f cholesterol-lowering drugs are being 
impeded by the bacteria in some people’s 
guts, Sony Tutejais hoping to work out how. 
Statins sometimes do a great job at 
reducing the amount of low-density lipo- 
protein (LDL), the ‘bad’ cholesterol that raises 
the risk of heart attacks and stroke, in the 
blood. Buta lot of people see less of a benefit, 
and some none atall. In a 2016 study, 46% of 
those treated with the drug rosuvastatin saw 
their LDL drop by 50% or more’. But 43% sawa 
less than 50% decrease, and 11% had no reduc- 
tion, or even had anincreasein LDL. 

The reason for the variation isn’t clear, but 
Tuteja, a pharmacogeneticist at the Univer- 
sity of Pennsylvania in Philadelphia, thinks the 
hundreds of species of bacteria in the intesti- 
nal tract might be involved. It could be that 
the drug throws the microbes out of balance 
in a way that alters cholesterol metabolism, 
or that certain strains of bacteria render the 
drugs less effective. Or, Tuteja suggests, “it 
could be bidirectional — the microbiome is 
affecting the drug and the drug is affecting 
the microbiome’”. 

Her hypothesis, which she is testing in a 
clinical trial, is that statins reduce circulating 
LDL by promoting the growth of gut bacteria 
that produce bile salt hydrolases — enzymes 
that break down the bile acids used to digest 
fatty foods. The liver makes bile salts out of 
cholesterol, so as bile acids are broken down, 
the organ pulls more cholesterol out of the 
blood to replace them — lowering the levels 
of LDL inthe blood. Ifsome strains of bacteria 
don’t produce as many hydrolases, that could 
explain why statins are less effective in some 
people. Or perhaps, as statins lower LDL lev- 
els, the gut is made more congenial for some 
bacteria and less so for others. In Tuteja’s trial, 
about 50 volunteers will take rosuvastatin for 
8 weeks; she will then compare the count of 
difference species of bacteria in their guts with 
that in people taking a placebo, to see whether 
the drug changes the make-up of the micro- 
biome. Tuteja and her team will also compare 
the distribution of bacteria with levels of bile 
acid in blood and faeces and the amount of 
LDL in the blood, to see whether the species 
present in the microbiome at the beginning 
of treatment can predict statin effectiveness. 
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Tuteja is one of a growing number of 
researchers looking into the gut microbi- 
ome’s role in drug metabolism, and whether it 
accounts for variations in how people respond 
to pharmaceuticals. A whole variety of drugs 
could be altering the balance of bacterial 
species, disrupting the digestive system or 
causing other problems. And gut bacteria 
producea range of enzymes and metabolites 
that might chemically alter drugs as varied 
as psychotropics and cancer treatments, 
rendering them less useful or leading to more 
side effects. 

Understanding the interplay between 
microbes and medicine could lead to new ther- 
apies, or to changes in how existing drugs are 
prescribed. For example, physicians might be 
able to predict howa person will respondtoa 
particular drug on the basis of their gut bacte- 
ria, and change a person’s prescription accord- 
ingly. Dietary changes or antibiotics might 
also be recommended to makea person’s gut 
microbiome more receptive to a drug. 


ABR ce. 
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The gut microbiome should be seen as a 
virtual organinits own right, argues Ted Dinan, 
a psychiatrist at APC Microbiome Ireland, a 
research centre at University College Cork. 
Such is its importance to drug metabolism, 
he says, “in a few years neither the US Food 
and Drug Administration nor the European 
Medicines Agency will license any drug unless 
its impact on that virtual organ has been 
studied”. (Another researcher at Microbiome 
Ireland, Niall Hyland, has received a€100,000 
(US$110,000) Global Grant for Gut Health, 
whichis supported by Nature Research — part 
of Nature’s publisher, Springer Nature — and 
probiotic company Yakult, based in Tokyo.) 


Back and forth 


Pharmaceuticals and bacteria have an undeni- 
able effect on each other. In 2018, researchers 
screened more than 1,000 drugs, marketed 
for various conditions, against 40 strains of 
human gut bacteria. They found that nearly 
one-quarter of those drugs had antibiotic 
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effects, even though none of them were being 
soldas antibiotics”. Andin 2019, another team 
found that of 271 drugs incubated with gut 
microbes, 176 were metabolized to such an 
extent that the level of the drug dropped by 
more than 20%?. 

Dinan and his colleagues are examining 
what part the microbiome plays in mental 
health, including whether it interacts with 
psychotropic drugs. Evidence suggests that 
low diversity of gut microbes is associated 
with mental-health conditions such as schiz- 
ophrenia. Researchers at Microbiome Ireland 
showed that it was possible to essentially 
transplant a mood disorder into rats by wip- 
ing out their native microbes with antibiotics 
and then giving the rats a dose of gut bacte- 
ria from people whom Dinan was treating for 
depression‘. “When they got atransplant from 
depressed patients, their behaviour was sig- 
nificantly altered,” Dinan says. That’s a strong 
sign that the microbiome can affect mental 
health, he says. 

This finding might have implications for the 
practice of faecal microbiota transplantation, 
an emerging treatment for gastrointestinal 
illnesses such as irritable bowel syndrome. 
Currently, donated faecal matter is tested 
for infections that could be passed on, such 
as hepatitis C. “Because of our study,’ Dinan 
says, “I’m convinced they should be looking 
at the psychiatric profile of the donor as well.” 

Researchers have also found that some 
bacteria can synthesize neurotransmitters, 
such as dopamine or acetylcholine, as well 
as precursor chemicals such as tryptophan, 
which is used to make the mood-regulating 
chemical serotonin. “We now know that cer- 
tain good bacteria — bifidobacteria — are 
capable of synthesizing tryptophan,” Dinan 
says. But the molecule is also found in foods 
suchas turkey, and it is not known how much 
of the tryptophan that makes it to the brain 
comes from diet and how much is produced 
by bacteria. 


Hearts and minds 


As they learn more, physicians might want to 
take into account a person’s particular mix 
of microbes when prescribing psychotropic 
drugs. Two species of gut bacteria, Enterococ- 
cus faecalis and Eggerthela lenta, metabolize 
the drug L-DOPA, which is used to treat Par- 
kinson’s disease®. Scientists have long known 
about an enzyme in the body that breaks down 
the drug and decreases the amount that makes 
it to the brain. Usually, physicians prescribe 
asecond drug alongside L-DOPA to partially 
counteract the breakdown, but breakdown of 
the drug by bacteria is not currently factored 
in. Researchers have, however, identified a 


molecule that inhibits F. faecalis’s activity. 
“There is some potential for translating this 
data ifa company or someone was interested,” 
says Peter Turnbaugh, a microbiologist at the 
University of California, San Francisco, who 
collaborated on the discovery. 

The idea that bacterial metabolism affects 
how well drugs work is not new, Turnbaugh 
says. Back in 2013, he and his colleagues found 
a pair of genes in E. lenta that give it the abil- 
ity to digest the heart-disease drug digoxin®. 
When they fed mice the amino acid arginine, 
however, digoxin levels stayed high. The 
researchers are not sure why that’s so, but it 
means that giving arginine along with digoxin 
could protect the drug. And there are signs 
that the bacterium might be responsible for 
the variation in how people respond to the 
rheumatoid-arthritis drug methotrexate. 
“We've known for almost a century now that 
the microbiome matters for drugs, but people 
have sort of ignored it,” Turnbaugh says. “Most 
development of drugs, as well as their use in 
the clinic, is microbiome-naive.” 


“I’mconvinced they 
should be looking at 
the psychiatric profile 
of the donor as well.” 


A detailed understanding of which micro- 
biota interact with which drugs, and the 
mechanisms behind those interactions, could 
suggest ways to either inhibit or enhance the 
interaction between drugs and the micro- 
biome. Some mechanisms are known. For 
instance, the colon-cancer drug campto- 
thecin-11 is metabolized by the liver into an 
inactive molecule; enzymes produced by gut 
bacteria, however, can reactivate it into a toxic 
form, causing severe diarrhoea. Researchers 
at the University of North Carolina at Chapel 
Hill have come up with a compound that 
could target the enzymes without disrupting 
the microbiome — a potential treatment for 
the diarrhoea. And their spin-off company, 
Symberix, is developing treatments to reduce 
side effects caused by gut bacteria. 


Complicated undertaking 


But untangling the complex interaction 
between drugs and the microbiome won't be 
an easy task. For one thing, the various species 
of bacteria in the human gut have 150 times 
more genes than the human genome. And the 
selection of microbes present in the gut can 
vary a great deal from person to person. “My 
microbiome is really different from yours,” 
says Anukriti Sharma, a microbiologist at 
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the University of California, San Diego. “That 
means we might also have very different genes 
that are involved in metabolism.’ In fact, one of 
the limitations of microbiome studies is that 
they have mainly taken place in the United 
States, Europe and China, but microbiomes are 
known to vary widely from region to region. 
That can have consequences for medicine, 
says Turnbaugh. “If you test a drug in America, 
it could behave completely differently in Africa 
or in South America,” he says. 

Another issue is that there doesn’t seem to 
be one common mechanism for how bacteria 
and drugs affect each other. “Each drug seems 
to have its own unique way of interacting with 
the microbiome,’ says Filipe Cabreiro, a bio- 
chemist at Imperial College London. That, he 
says, makes it difficult to draw any general 
conclusions. 

Still, Cabreiro says, there are broad simi- 
larities in how certain classes of drugs work 
with the microbiome. Antipsychotics often 
change the balance of gut bacteria. Some can- 
cer drugs are degraded or modified by chem- 
istry in the gut that either enhances or reduces 
their effects (see page S16). Metformin, acom- 
mon diabetes drug that Cabreiro is studying 
for its anti-ageing potential, seems to trigger 
certain signalling pathways in bacteria that 
changes the production of metabolites, which 
then have their own effects on the body. “We 
have to take it a drug at a time, a microbe at 
atime, anda disease at atime,” Sharma says. 

If that complexity can be worked out, the 
next step will be to look at altering the micro- 
biome to enhance drugs’ effectiveness or 
decrease their side effects. As with the heart 
medication digoxin, that could mean supple- 
menting a drug with another compound that 
influences its mechanism of interaction. It 
could also mean trying to change the make-up 
of the bacterial community, whether through 
strategic use of antibiotics, dietary changes to 
promote or discourage particular microbes, 
or even faecal transplants to replace ‘bad’ gut 
bacteria with ‘good’ ones. 

And it could make precision medicine more 
precise, with physicians sequencing not only 
the genes of patients but also of their microbes 
to predict response to a treatment. “For the 
future of personalized medicine,’ Cabreiro 
says, “you have to take into account not just 
the host, but the microbiome too.” 


Neil Savage is a science journalist in Lowell, 
Massachusetts. 
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Neuroscientist Egle Cekanaviciute found that people with multiple sclerosis have different gut microbiomes from those without the disease. 


When immunity 


goes wrong 


If the gut microbiome can trigger autoimmune 
diseases, can it also help to cure them? By Eric Bender 


or Gregg Silverman, arheumatologist at 

New York University, the day awomen 

he was treating for lupus was visited 

by her identical twin sister was a water- 

shed moment. The sister was a picture 

of health, witha one-year-old child inher arms. 
Silverman’s patient, meanwhile, was receiving 
kidney dialysis and, despite his best efforts, 
her condition was getting worse. “Genetics 
was not going to explain this difference,” says 
Silverman. The revelation launched him ona 
decades-long quest to seek out other factors 
that drive the puzzling autoimmune disease. 
Researchers investigating other autoim- 
mune diseases have also been looking beyond 
genetics. Inthe case of type 1 diabetes, the epi- 
demiological evidence for doing so is over- 
whelming, says Jayne Danska, a geneticist 
at the Hospital for Sick Children in Toronto, 
Canada. Genetics “doesn’t explain why the 
incidence of the disease has been rising over 
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the last 50 years in many different countries 
— and it doesn’t explain why the age of onset 
is becoming progressively earlier”, she says. 
Many events, including viral infections and 
certain foods, have long been suspected of 
helping to trigger autoimmune diseases, in 
which the body attacks its own cells. But over 
the past decade, new suspects have emerged 
—thetrillions of microbes inhabiting the diges- 
tive tract. Scientists have now implicated the 
gut microbiome in numerous autoimmune 
conditions, including lupus, type 1 diabetes, 
rheumatoid arthritis and multiple sclerosis. 
For example, in 2017, researchers at the Uni- 
versity of California, San Francisco, compared 
the gut microbiomes of people with multiple 
sclerosis with those of healthy volunteers. This 
study, led by geneticist Sergio Baranzini and 
neuroscientist Egle Cekanaviciute, found that 
many bacterial species were present in very 
different quantities in the two groups’. This 
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research “not only identified differences in 
microbial communities, but actually showed 
that they had physiological significance ina 
human immune-cell experimental system”, 
says Cekanaviciute, who now studies micro- 
biome health effects at NASA’s Ames Research 
Center in Moffett Field, California. 

When two species that were more abundant 
in people with multiple sclerosis were incu- 
bated in human blood cells in vitro, the cells’ 
inflammatory responses climbed. Another 
bacterial species, whose levels were depressed 
in people with multiple sclerosis, stimulated 
anti-inflammatory cells. And when the inves- 
tigators transferred a microbiome froma per- 
sonwith multiple sclerosis into germ-free mice 
(those reared to be devoid of microorganisms), 
“these mice gota lot sicker than mice receiving 
ahealthy human microbiome’”, says Baranzini. 

Scientists are trying to understand the 
mechanisms behind the apparent ability of the 
gut microbiota to trigger or to sustain auto- 
immune conditions. They hope to turn that 
knowledge into better therapies for conditions 
that are currently difficult to treat — perhaps 
even in the form of simple probiotic pills. 


Molecular mimicry 

Autoimmune diseases are often traced, in part, 
to alterations inthe human leukocyte antigen 
(HLA) gene complex, a cornerstone of the 
adaptive immune system, which recognizes 
and remembers specific pathogens. HLA genes 
express proteins that present antigens to our 
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immune system’s T and B cells. The immune 
cells then spot and attack dangerous intrud- 
ers carrying those antigen flags. T cells and 
B cells are selected out to ignore the body’s 
own cells but in autoimmune diseases this 
doesn’t happen. 

Although most T cells are trained in the thy- 
mus toignore ‘self’ proteins, some are trained 
inthe gut. “Given all the different environmen- 
tal factors that come in contact with the gut, 
you need a lot of immune tolerance there,” 
explains Marika Falcone, an immunologist at 
IRCCS Ospedale San Raffaele in Milan, Italy. 

Experiments in colonizing germ-free mice 
with specific microbes in the gut have shown 
that the effects are broadcast throughout 
the immune system, Danska says. In turn, the 
immunesystem in the gut affects the microbes 
there. Biologists are exploring various routes 
by which the gut microbiome might help to 
stimulate or stop immune pro-inflammatory 
responses — driven either by the bacteriathem- 
selves, or by the metabolites they produce, the 
immune cells they train, or another mechanism. 

One line of enquiry is whether the enormous 
genetic variation between microbes leads to 
immune cells becoming confused as to what 
is foreign and what is self. A meta-analysis that 
examined 3,665 human samples identified 
more than 22 million gut microbiome genes’. 
The proteins produced by these genes are scru- 
tinized by the immune system, and, overwhelm- 
ingly, found to be harmless or easily handled. 

But sometimes microbial proteins that 
alarm immune cells contain fragments that 
closely resemble those of normal human pro- 
teins. With roughly a hundred times as many 
genes in our individual microbiomes as in 
our own genomes, there’s a high likelihood of 


“We want to understand 
the rules that regulate the 
bacterial society in the gut.” 


similarities, says Martin Kriegel, animmunob- 
iologist at Yale University in New Haven, Con- 
necticut. The result, so the theory goes, is that 
this starts to teach the immune system to rec- 
ognize human proteins as signs of a threat. In 
such cases of molecular mimicry, “the immune 
system gets confused”, says Baranzini. “It starts 
reacting against the bacteria. And then it ends 
up reacting against our own self proteins.” 
Kriegel and his colleagues demonstrated 
a molecular mimicry response in cells from 
people with lupus using a bacterial protein 
very similar to the human protein Ro60 (ref. 3). 
Molecular mimicry could also be at work in 
rheumatoid arthritis — peptides produced 
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Micrograph of a spinal cord affected by 
multiple sclerosis. 


by gut bacteria such as Prevotella closely 
resemble human peptides presented to the 
immune system in the joints of people with the 
condition. Ina 2017 study*, immune reactions 
to the microbial peptides corresponded with 
those of the matching host peptides, “which 
was a pretty strong signal”, says Allen Steere, 
a rheumatologist at Massachusetts General 
Hospital in Boston and an author of the study. 


Gut to go 


If gut microbiota do confuse the immune 
system, the question remains of how such 
autoimmune effects spread from the gut. In 
some cases, specific cells are affected, such 
as nerve cells in multiple sclerosis, and pan- 
creatic B-cells in type 1 diabetes. In lupus and 
rheumatoid arthritis, autoimmunity occurs 
across multiple organs. 

Steere and his colleagues found evidence 
of Prevotella DNA in the joints of some peo- 
ple with rheumatoid arthritis*. That finding, 
Steere says, suggests that either the bacteria 
themselves, or bacterial remnants carried by 
immune cells, can get into joints. 

In the case of multiple sclerosis, “I don’t 
think the bacteria move, but their metabolites 
do,” says Patrizia Casaccia, a neuroscientist 
at the City University of New York. She notes 
that the metabolites might signal throughthe 
vagus nerve, which transmits messages from 
the gut to the brain. In some cases, bacteria 
themselves have been found in affected organs 
— suchas in the pancreas in type 1 diabetes. 

Kriegel and his team showed that in mice 
predisposed to a lupus-like condition, Ente- 
rococcus gallinarum bacteria move out of the 
gut to other organs, including the liver, where 
they set off an immune cascade that leads to 
lupus®. The investigators also identified similar 
biological pathways in human liver cells. Most 
importantly, Kriegel says, the bacteria were 
found in most liver biopsies from people with 
lupus — but not in those from healthy people. 
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His team also showed that either antibiotic 
treatment or a vaccine against F. gallinarum 
prevented autoimmunity developing in mice. 
“One could already envision a potential future 
therapy targeted against these bugs that cross 
the gut barrier,” Kriegel says. 


Microbes in the clinic 


Reports about faecal matter transplants 
(FMTs) or probiotic pills have given people 
with immune conditions hope that there 
could be an easy way to prevent or treat their 
disease. Scientists share this desire, but warn 
that clinical research has barely begun. 

For multiple sclerosis, for example, treat- 
ment might eventually “be as simple as a tar- 
geted dietary intervention that will shift the 
community from pro-inflammatory bacteria 
to more anti-inflammatory types”, says Baran- 
zini. In one possible step towards this goal, his 
team is running a small phase I clinical trial of 
FMT to assess safety and side effects. 

Casaccia emphasizes the importance of 
proceeding with caution. “We want to under- 
stand the rules that regulate the bacterial 
society in the gut,” she says. “Maybe we can 
develop a combination of healthy probiot- 
ics and prebiotics to support the growth of 
the beneficial bacteria, and perhaps dietary 
manipulation might contribute to that,’ she 
says. “But I’m not sure we are there yet.” 

Researchers do see major progress. “The 
enormous effort that’s been spent over the last 
20 years using different kinds of tools in the 
box is really beginning to bear fruit,’ says Dan- 
ska. Her team has built a platform to identify 
antibacterial antibodies in blood. Analysing 
samples from children at high risk of type 1 
diabetes, the platform revealed important 
clues about who would develop the disease®. 

Danska hopes that better knowledge about 
the gut microbiome, especially during the first 
3 years of life, will lead to disease-preventing 
interventions. Those might include giving 
babies well-defined compositions of microbes, 
so that achild’s immune system “develops with 
optimal tolerance to self without sacrificing 
their ability to fight infection”, she says. “That’s 
the kind of therapy that could have global 
impact because bugs are cheap. If we can come 
up with defined compositions of microbesina 
gummy bear — now we're talking!” 


Eric Bender is a science writer in Newton, 
Massachusetts. 
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Autism and the gut 


The gut microbiome might have an integral role in autism spectrum 
disorder. But the link remains uncertain. By Elizabeth Svoboda 


Achild with autism spectrum disorder communicates using symbols during lunch. 


efore Ethan Loyola reached his first 
birthday, it was clear that something 
was wrong with his gut. As a baby, he 
had been given several courses of anti- 
biotics to treat severe ear infections, 
after which he experienced foul-smelling, 
acidic diarrhoea that left him convulsed with 
pain. Around the age of one, Ethan lost the 
words he’d started to say and stopped mak- 
ing eye contact. Soon after that, he received 
a diagnosis: autism spectrum disorder (ASD). 

As Ethan grew older, his digestive issues con- 
tinued, and he struggled around other people 
and in unusual situations. “He had his hands 
over his ears, didn’t want to be in crowds,” 
says his mother, Dana Woods. “It was just too 
overwhelming.” Then Ethan’s dad saw a flyer 
tacked up at an autism therapy clinic. Scien- 
tists at Arizona State University in Tempe, near 
the family’s Phoenix home, were looking for 
children with autism to try an experimental 
treatment called microbiota transfer therapy, 
which would be used to recolonize the chil- 
dren’s guts with bacteria from donors who 
were not on the autistic spectrum. Ethan’s 
parents enrolled him in the study. 

ASD can cause various language and social 
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difficulties. And, for the 1 in 160 children it 
affects worldwide, treatments have been frus- 
tratingly slow in coming. Some parents have 
long claimed that changing a child’s diet or 
giving them probiotics improves notjust diges- 
tive problems, but also behavioural symptoms. 
Now, researchers are revealing that in addition 
to aiding digestion, gut bacteria also manufac- 
ture bioactive compounds that help to orches- 
trate brain function and social development. 
Studies show that children with ASD often have 
amix of gut microbes thatis distinct from that 
inchildren without the condition. And in lab ani- 
mals, autism-like symptoms arise when normal 
species of gut bacteria are absent. 

Although far from conclusive, these find- 
ings are driving researchers to probe the links 
between gut microbes and autism symptoms 
— and to begin testing ASD treatments that 
repopulate the gut microbiome from scratch. 


Laying gut groundwork 

John Cryan, a biochemist at University College 
Corkin Ireland, was among the first research- 
ers to investigate how gut microbes affect 
social behaviour. In 2014, he reported that 
germ-free mice — those lacking the typical 
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mix of gut microbes — avoided other mice, 
shunned new social situations and groomed 
themselves excessively’. “It started to crystal- 
lize that the microbiome was involved in many 
aspects of behaviour,” Cryan says. “There 
seems to be something about the social brain 
in particular that makes it sensitive to signals 
fromthe microbiome.” Heand other scientists 
proposed the existence of a gut-brain axis, 
in which gut microbes produce bioactive 
compounds that influence brain function. 

Other studies have bolstered this theory, 
showing that when gut bacteria help to digest 
food, they generate a host of by-products that 
can affect thinking and behaviour. Clostridia 
bacterial pathogens, for instance, generate 
propionic acid in the gut — a short-chain 
fatty acid known to disrupt the production of 
neurotransmitters. Propionic acid also causes 
autism-like symptoms in rats, such as repeti- 
tive interests, unusual motor movements and 
atypical social interactions’. 

Deficits in beneficial gut bacteria might also 
affect social brain function. In 2017, Cryan 
reported that when mice with an autism-like 
condition had lower levels of Bifidobacterium 
and Blautia gut bacteria, their guts made 
less tryptophan and bile acid — compounds 
needed to produce serotonin’. And children 
with autism have been consistently found to 
have lower levels of Veillonellaceae, Coproc- 
occus and Prevotella gut bacteria than those 
without the condition‘. Researchers have also 
observed that some people with ASD could 
have an abnormally porous blood-brain 
barrier, which allows some toxic bacterial 
by-products to enter the bloodstream and 
reach the brain’. 


Behavioural change 


Such findings prompted researchers at the 
California Institute of Technology in Pasadena 
to transfer the microbes of people with ASD 
into mice. Microbiologist Sarkis Mazmanian 
and his colleagues gave germ-free mice gut 
microbes from people with ASD; at six weeks 
old, offspring of these mice socialized less, 
produced fewer vocal sounds and engaged 
in more repetitive behaviour, compared with 
mice descended from animals that received gut 
microbes from donors who did not have ASD*. 
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The differences between the two groups also 
proved striking ona biological level. “We uncov- 
ered several differences in faecal and serum 
metabolites,” says Gil Sharon, a co-author 
of the research. In particular, mice with gut 
microbes from donors with ASD had lower 
levels of compounds produced by gut bacte- 
ria that are thought to affect brain function. 
In particular, the amino acids 5-aminovaleric 
acid (SAV) and taurine were depleted — both 
metabolites increase the activity of the brain’s 
y-aminobutyric acid (GABA) receptors. GABA 
is aneurotransmitter involved in sensory pro- 
cessing and motor control, and abnormalities 
inthe GABA system have been notedin children 
with ASD. 

When the team gave the two missing metab- 
olites to mice with autism-like symptoms, 
Sharon says, “they improved core deficits in 
social interaction and repetitive behaviour”. 
The study has received some criticism regard- 
ing its data analysis and small sample size. The 
finding that mice with microbes from donors 
with ASD vocalized less, one critic argued (see 
go.nature.com/2qbqwy1), involved gut bacte- 
ria from only five donors with autism and three 
without, andthe researchers ran many analyses 
onthe data to finda difference between the two 
groups of mice — the more analyses run ona 
data set, the more likely it is that a false effect 
could be found. The team declined to comment 
on these criticisms, but stands by its results. 

The growing focus onthe role of gut microbes 
in social brain function prompted the team at 
Arizona State University to up the ante by test- 
ing microbes in children with ASD. Microbial 
ecologist Rosa Krajmalnik-Brown and her col- 
leagues recruited a group of 18 children with 
ASD aged between 7 and 16, including Ethan 
Loyola. Like Ethan, all of the children had a his- 
tory of gastrointestinal problems, including 
chronic diarrhoea, pain and constipation. 

The children inthe study took the antibiotic 
vancomycin for two weeks to remove existing 
bacteria. Then, each of them received a high 
dose of gut microbes from donors without 
autism — some children received this as an 
enema, whereas others took a bacteria-for- 
tified drink. The children continued to take 
a daily dose of microbes for seven or eight 
weeks (Ethan took his in a shot glass, mixed 
with cranberry juice), along with an antacid 
to neutralize stomach acid and improve the 
chance of the new microbes surviving. 

At the end of the 18-week study period, the 
children’s gastrointestinal symptoms had 
reduced by 80%, and most of the improvement 
remained two years after the original study. By 
the two-year mark, the children’s scores ona 
test to measure how much they were affected 
by their autism were an average of 47% lower 


than they had been at the beginning of the 
trial. Also at the two-year follow-up, the chil- 
dren had increased gut-bacterial diversity and 
greater numbers of gut bacteria that are often 
found inlower numbers in children with ASD, 
such as Bifidobacteria and Prevotella. This 
suggests that the treatment had succeeded 
in changing their microbiomes long term, 
Krajmalnik-Brown says. 


A muddled outlook 


Although these early results look promising, 
the authors stress that little will be known for 
sure until they complete a larger trial that 
includes a control group. “The fact that we 
have exciting results doesn’t mean that this will 
work for everyone,” Krajmalnik-Brown says. 
Her team is now carrying out a phase II trial 
of microbiota transfer therapy with 84 adults 
with ASD, some of whomwill receive a placebo. 
The team is also looking for funding to start a 
similar trial in children. And even ifthese trials 
are successful, even bigger phase III trials will 
be needed a few years down the line before 
the treatment can be used in the clinic. Along 
the way, Krajmalnik-Brown’s team is hoping to 
work out the biological mechanisms behind 
the improvement in symptoms. 

There is currently no approved medication 
specifically for the core symptoms of autism — 
communication difficulties, social challenges 
and repetitive behaviour. Krajmalnik-Brown 
hopes that her research will lead to one, but 
this is by no means assured. Although studies 
report that people with ASD havea gut-bacte- 
rial profile distinct from that of people without 
the condition, no study has yet shown that gut 
microbes cause autism in people. One lead- 
ing hypothesis is that some microbes could 
cause autistic symptoms to be worse in people 
who are prone to the condition, but this too 
remains unproven. People with ASD might 
harbour different gut microbes from those 
without the condition because they tend to 
have very specific dietary preferences. 

Likewise, evidence that gut microbial ther- 
apies can help people with ASD is mixed. 
Krajmalnik-Brown strikes an optimistic note, 
but a review of four other studies did not reach 
aclear conclusion about the benefits of gut bac- 
terial supplementation in children with ASD 
— although these earlier studies used ordinary 
probiotics and not bacteria directly from the 
human gut’. “The human datais very heteroge- 
neous,’ Cryansays. “If we target the microbes, it 
willbe interesting to see if we have behavioural 
effects, but we need more evidence overall.” 

As KrajmaInik-Brown’s trials progress, she 
warns children and families to steer clear of 
do-it-yourself faecal transplants. Instead, she 
says they should discuss options for altering 
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The surface of the small intestine has 
projections called villi. 


the balance of gut microbes, such as conven- 
tional probiotics, with their physician. She also 
wants to temper people’s expectations for the 
treatment. “This might help alleviate the symp- 
toms.| don’t believe that we have acure on our 
hands,” she says. 

For Ethan, however, the Arizona State 
study resulted ina striking turnaround. After 
the microbiome transfer, his diarrhoea and 
cramping disappeared in months. But most 
dramatic was his new interest in people. He 
began waking up witha smile, crowing, “Good 
morning!” He also started asking questions, 
such as ‘why is the sky blue and the clouds 
white?’ something he had not done before, 
his family says. In social-skills tests, Ethan, who 
isnow12,scoresinthe same rangeas children 
who are not on the autistic spectrum. 

Did Ethan make these strides because his 
crippling stomach pain went away? Or have 
the new microbes truly changed the way 
his brain works? No one knows for sure. But 
Ethan’s family welcomes the research. “The 
study was the tipping point,” his mother says. 
“It was completely life altering.” 

Looking forward, Cryan says, it’s crucial for 
scientists to keep decoding the biological path- 
ways that connect the gut to the brain. “We're 
optimistic that a healthy gut will meana healthy 
brain, but we need the data.” Those data should 
help to determine just what fuelled Ethan’s 
improvements — and whether thousands of 
other children could follow a similar course. 


Elizabeth Svoboda is a science writer in San 
Jose, California. 
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Fighting cancer 
with microbes 


Targeting the microbiome could hold the 
key to combating a range of malignant 


diseases. By Elie Dolgin 


nJuly 1984, a young Australian gastroen- 

terologist drank a beef broth spiked with 

the pathogenic bacterium Helicobacter 

pylori. Within a week, he started vomiting. 

His breath began to stink. And he couldn’t 
have been happier. 

Barry Marshall wanted to prove that H. pylori 
could trigger inflammation of the stomach 
lining, a first sign of stomach cancer. By taking 
a biopsy of his own stomach tissue, Marshall 
demonstrated unequivocally that the hardy, 
spiral-shaped microorganism could cause 
gastric disease. Twenty-one years later, Mar- 
shall and his mentor Robin Warren won the 
Nobel Prize in Physiology or Medicine for their 
discovery linking the bacterium to chronic 
inflammation, peptic ulcers and stomach 
ailments such as cancer. 

Yet H. pyloriwas long considered to belong 
toaspecial club of infectious agents, together 
with viruses such as human papillomavirus, 
that could provoke tumour formation. In 
oncology circles, the trillions of microbes that 
inhabit our guts, skin and other tissues were 
seen mostly as benign bystanders. 

Cancer researchers now realize that many 
of those seemingly harmless microbes are 
anything but. Over the past decade, it’s 
become clear that gut microbes can pro- 
duce DNA-damaging toxins and carcino- 
genic metabolites, induce cancer-promoting 
inflammation, make tumours more resistant 
to chemotherapy drugs, and suppress the 
body’s anticancer immune responses. “Every 
day nowthere seems to be some new microbe 
associated with cancer,” says Susan Bullman, a 
microbiologist at the Fred Hutchinson Cancer 
Research Center in Seattle, Washington. 

While researchers such as Bullman are now 
racing to unravel the molecular mechanisms 
behind tumour-promoting bacteria — and in 
so doing, identify targets for risk assessment, 
early detection, prevention and treatment — 
many cancer researchers are already testing 
whether microbiome-based therapeutics 
can improve the efficacy or safety of exist- 
ing anti-cancer interventions. Borrowing a 
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page from the gastroenterology playbook, 
oncologists around the world have begun 
giving their patients faecal transplants and 
bacteria-filled capsules. 

These living medicines have already revolu- 
tionized the treatment of antibiotic-resistant 
gut infections. A few studies have also shown 
the potential of faecal transplants for people 
with blood cancers receiving a stem-cell trans- 
plant. (These patients must take broad-spec- 
trum antibiotics to prevent infections, but in 
so doing they lose the bacteria that are needed 
to prevent donated cells attacking the host.) 
Now, researchers are beginning to find that a 
dose of beneficial microbes enhances the effi- 
cacy of immune-modulating drugs known as 
checkpoint inhibitors and mitigates toxicity. 

“Modulating the microbiome makes com- 
plete sense,” says Jennifer Wargo, a surgical 
oncologist at the University of Texas MD 
Anderson Cancer Center in Houston. “Peo- 
ple are really embracing the idea and we’re 
beginning to see the early fruits of that labour.” 


Not the only bad guy 


Microbiologist Jun Yu of the Chinese Univer- 
sity of Hong Kong has begun to take a close 
look at the role that bacteria have in driving 
stomach cancer. Yu's team identified a handful 
of microbes that were consistently enriched 
in samples from people with gastric cancer’ 
or precancerous stomach lesions. “H. pylori 
is not the only bad guy,’ she says. 

Yu suspects that the focus on H. pylori was 
anaccident of history. This microbe happened 
to growin laboratory cultures — the standard 
technique for bacterial identification during 
the 1980s. Yu’s team relied on DNA analyses 
instead. “Gene sequencing provides a good 
opportunity to identify other microbesinthe 
stomach that also play some role but weren't 
discovered before,” she says. Her team is now 
evaluating the tumour-causing potential of 
these bacteria in mouse models. 

Gut microbes have also been linked to bowel 
cancer, the third most common type of cancer 
worldwide. A toxin produced by astrain of gut 
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Scanning electron micrograph of Helicobacter 
pylori on the surface of the intestine. 


bacterium called Bacteroides fragilis, a cause 
of diarrhoeal disease in young children, was 
implicated inthe development of bowel cancer 
in 2009. Cynthia Sears, an infectious-disease 
specialist at Johns Hopkins University in Bal- 
timore, Maryland, who made the original dis- 
covery in mice, has since shown that the toxin 
recruits immune cells to the intestinal lining 
and promotes an inflammatory cascade that 
can lead to cancer’. 

And ateam led by Christian Jobin, animmu- 
no-microbiologist at the University of Flor- 
ida in Gainesville, made a similar discovery: 
a strain of Escherichia coli that induces bowel 
cancer through the production ofa toxin that 
damages DNA’. None of these pathogens nec- 
essarily work in isolation, however. “It’s an 
assembly of microorganisms that can collec- 
tively impact genome stability and immune 
function,” Jobin says. 

Evidence for the idea that both B. fragilis and 
E. coliwork together to fuel tumour growth 
comes from Sears, who showed that people 
whoare genetically predisposed to bowel can- 
cer frequently have patchy bacterial biofilms 
in their colons composed predominantly of 
these two microbes’. 


Menace of microbes 


Although it did not crop up in Sears’s study, 
there are multiple lines of evidence that the 
bacterium Fusobacterium nucleatum — a 
spindle-shaped microbe found in most peo- 
ple’s mouths — is also a major driver of bowel 
cancer. It seems to promote proliferation of 
cancer cells through direct interactions with 
intestinal cells, rather than through a toxin 
intermediary. 

The association between F. nucleatum 
and cancer first emerged through sequenc- 
ing-based studies of tumour samples. 
Immunogeneticist Robert Holt at the British 
Columbia Cancer Agency in Vancouver, Can- 
ada, compared RNA from 11 bowel tumours 
with sequences from adjacent normal tissues’. 
When he and his colleagues looked for micro- 
bial genes expressed at elevated levels in the 
tumours, “the Fusobacterium signal really 
_ jumped out”, Holt says. Around the same time, 
a team led by immunologist Wendy Garrett 
and cancer geneticist Matthew Meyerson at 
Dana-Farber Cancer Institute in Boston, Mas- 
sachusetts — stumbled on the same microbe 
after analysing the DNA of bowel tumours°. 

In the years since, research teams from the 
Czech Republic, China, Japan and South Korea 
have all found that people with higher levels 
of F. nucleatum in their bowel tumours tend 
to have worse survival outcomes. The biolog- 
ical explanation, however, remains elusive. 
“If you have Fusobacterium and advanced 


colorectal cancer, that sucks because your 
time to progression is going to be shorter,” 
Garrett says. “But why is that?” 

Research in mice by scientists including 
Garret suggests that the presence of F. nucle- 
atum increases cancer growth, possibly 
through the modulation ofimmune responses 
or through the direct activation of cancer sig- 
nalling pathways. The microbe also seems to 
promote resistance to chemotherapy through 
the induction of a cellular recycling process 
that enhances tumour survival in the face of 
drugs. Other tumour-resident bacteria can 
metabolize chemotherapeutics, which further 
contributes to drug resistance. 

Last year, an international team led by 
Garrett and Meyerson secured a £20-million 
(US$26-million) grant from Cancer Research 
UK to discover exactly how microbes in the 
body lead to bowel cancer. Part of the funding 
is earmarked for finding new ways to eliminate 
the prime suspects — F. nucleatum, B. fragilis 
and £. coli — without disturbing the entire 
microbial ecosystem inside the gut. 

In 2017, Bullman, who was working in 
Meyerson’s lab at the time, showed that the 
antibiotic metronidazole could slow cancer 
growth in mice with Fusobacterium-positive 
tumours’. But the antibiotic also targets a 
range of beneficial microbes in the intestines. 
“What we really need,” says Bullman, “are more 
targeted approaches.’ A drug specific to Fuso- 
bacterium could do the trick. Ora vaccine. Or 
perhaps a phage therapy that takes advantage 
of bacteria-infecting viruses to precisely elim- 
inate the nefarious microbe. 


Targeted attack 


At the 2019 International Cancer Immuno- 
therapy Conference in Paris, scientists from 
microbiome therapeutic company BiomxX in 
Ness Ziona, Israel, reported the discovery of 
several Fusobacterium-targeted phages that, 
when injected into the bloodstream of mice, 
could successfully invade bacteria nestled 
inside implanted tumours. According to chief 
executive Jonathan Solomon, Biomx is har- 
nessing synthetic biology to turn the phages 
into programmable bacterial assassins that 
could also deliver a therapeutic payload that 
helps to recruit tumour-fighting immune cells 
to the site of attack. 

Alternatively, predatory bacteria called 
Bdellovibrio could be deployed in the fight 
against cancer-causing microbes — and 
microbiologist Emma Allen-Vercoe from the 
University of Guelph, Canada, didn’t need to 
look far to discover one that could destroy 
Fusobacterium. Her team dug up a patch of 
clover from the lawn in front of the university’s 
clock tower. Among the germ-eating microbes 
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they found, “someare real championsat killing 
fusobacteria”, she says. 

Despite these efforts, most specialists 
acknowledge that it remains to be shown defin- 
itively whether the human microbiome has 
a causal role in cancer. It could be that some 
bacteria are merely opportunistic invaders of 
tumours once cancer has already taken hold. 

“We're lacking solid direct evidence in the 
form of longitudinal cohort studies that the 
microbiome causes cancer,” says Alasdair 
Scott, acolorectal surgeon at Imperial College 
London. Heis one of the architects of the Inter- 
national Cancer Microbiome Consortium, a 
global body that aims to establish expert con- 
sensus on the role of the microbiome in oncol- 
ogy. Last year, the group wrotea policy paper 
calling on researchers to address the question 
of causation versus association®. 


“Every day nowthere seems 
to be some new microbe 
associated with cancer.” 


Prospective cohort studies take years, how- 
ever, and newtreatment options for cancer are 
needed now. “You can’t wait until everything 
is completely known,” Holt says. Clinicians are 
therefore moving ahead with testing microbi- 
ome modulation, especially in patients receiv- 
ing checkpoint-inhibitor drugs designed to rev 
up the body’s antitumour immune response. 

At the 2019 meeting of the American Asso- 
ciation for Cancer Research, two research 
groups described promising work in people 
receiving immunotherapy for melanoma. In 
both cases, the tumours of people who initially 
did not benefit from the treatment shrank 
after receiving faecal matter from someone 
who did respond to the drugs. 

Elsewhere, researchers have been adminis- 
tering poo-stuffed pills from healthy donors 
alongside immunotherapy, with similarly 
favourable results. And, given that the pro- 
cedure is generally considered to be low 
risk, some clinicians are beginning to think 
about a future in which stool swaps become 
a standard add-on to immunotherapy for all 
patients, not just those who fare poorly onthe 
immune-targeted drugs to begin with. “Every- 
body might be doing it right off the bat,” says 
John Lenehan, an oncologist at the London 
Regional Cancer Program in Canada. 

The safety of faecal transplants was called 
into question, however, when researchers 
last year described how contaminated stool 
left one man dead and another severely 
ill in experimental trials investigating 
the procedure for other applications’. 
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People with cancer might benefit from receiving microbial therapies alongside cancer drugs. 


Physicians now test donated samples for the 
drug-resistant strain of £. colithat caused the 
infections. But fearing that a newly virulent 
microbe could slip through the screening 
process, some researchers are turning to 
defined, well-characterized and lab-grown 
formulations of microbes. 

These blends of cultured strains are typi- 
cally selected on the basis of observational 
human studies and mouse experiments 
that test which organisms most influence 
the response to immunotherapy. Wargo, 
for example, led one of anumber of groups 
that described correlations between clinical 
responses to checkpoint inhibitors and the 
composition of the gut microbiome. Micro- 
biomics company Seres Therapeutics in Cam- 
bridge, Massachusetts, took those findings, 
incorporated extra in-house data, and created 
a mix of strains from dozens of bacterial spe- 
cies, allinspore form. Researchers, including 
Wargo, have began testing the Seres productin 
people with advanced-stage melanoma. 

Microbial therapeutics company Vedanta 
Biosciences, also in Cambridge, picked 
11strains for its bacterial cocktail by looking for 
microbes in human faeces that most potently 
elicited the desired immune responses in 
mice. A team including Vedanta’s scientists 
showed how each strain in isolation could 
enhance antimicrobial or antitumour immu- 
nity in mouse models”. “However, assembled 
in certain consortia, they had a much larger 
effect,” says study co-author and Vedanta chief 
executive Bernat Olle. 

Some firms, including pharmaceutical com- 
pany 4D Pharmain Leeds, UK, are paring down 
the therapeutic approach even further and 
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administering single microbial strains with 
immune-stimulating effects. The company’s 
scientists have described a strain of Enterococ- 
cusgallinarum, isolated froma healthy human 
gut, andits structural protein flagellin, which 
rouses the immune system by interacting with 
areceptor found on intestinal cells. The firm is 
now testing that strain in the clinic in people 
with lung, kidney, bladder and skin cancers, 
both as a therapy ahead of surgical removal 
of the tumours and in combination with a 
checkpoint inhibitor. 


Poo versus pills 


Bryan Coburn, an infectious-disease special- 
ist at the Toronto General Hospital Research 
Institute in Canada, points to several bene- 
fits of using rationally designed consortia of 
bacteria rather than relying on donor faecal 
material. “There are specific safety advan- 
tages, because we know exactly what’s going 
in,” says Coburn, who is clinically evaluating 
a multi-strain pill for cancer from NuBiyota, 
which is based in Pearl River, New York, and 
co-founded by Guelph’s Allen-Vercoe. Pre- 
pared formulations are scalable and modifi- 
able, Coburn says. Moreover, “we can assess 
things like potency, which you can’t do easily 
with faecal transplants”, he adds. 

Microbial therapies might also need to be 


“People are really embracing 
the idea and we're beginning 
tosee the early fruits of 

that labour.’ 
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tailored to certain tumour types. For example, 
men with metastatic prostate tumours who 
responded to checkpoint inhibition have been 
found to have lower levels of amicrobe called 
Akkermansia muciniphila in their stool than 
did men who did not respond. But the oppo- 
site is true of people with lung and kidney 
cancers — those with more A. muciniphila in 
their guts tended to fare better on the therapy. 

Amy Moran, an immunologist at Oregon 
Health and Science University in Portland, 
thinks that the different treatments that 
people with different cancers receive might 
explain the discrepancy. “So many other 
types of drugs that these patients take might 
be impacting the composition of the micro- 
biome,’ she says. She suspects that, for pros- 
tate cancer, the hormone therapies commonly 
used as first-line treatments might be to 
blame. Uncertain of which bacteria will be best 
in this context, Moran and her colleagues are 
starting by trying to boost immunotherapy 
response with complete faecal transplants. 

Elsewhere in the oncology clinic, research- 
ers are turning to microbiome therapeutics to 
manage some of theimmune-related toxicity 
associated with checkpoint-blocking drugs. At 
the University of Texas MD Anderson Cancer 
Center, gastroenterologist Yinghong Wang 
is using faecal transplants to manage cases of 
immunotherapy-induced colitis. In 2018, she 
described how a woman with bladder cancer 
andamanwith prostate cancer, both of whom 
developed side effects including bloody diar- 
rhoea after receiving checkpoint inhibitors, 
saw their symptoms resolve after one or two 
transplants of stool from a healthy donor". 
Wang has since treated another dozen or so 
people. “All of them seem to benefit from this 
treatment,” she says. 

None of Wang’s patients had previously 
received faecal transplants to improve thera- 
peutic responses. If they had, she suspects they 
would not have developed the side effects in 
the first place. Microbiome modulation might, 
therefore, offer a double benefit for people 
with cancer — enhancing response rates to 
other drugs while also guarding against the 
worst of their ill effects. 


Elie Dolgin is a science journalist in 
Somerville, Massachusetts. 
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The gut microbiome 
outlook 


Perspective: 
Another dimension 
for drug discovery 


‘Deep phenotyping’ of 
human cohorts, including 

the collection of microbiome 
data, could transform therapy 
development, says Eran Segal. 


ast year, drug company Novartis began charging 

US$2.1 million per patient for its new spinal mus- 

cular atrophy treatment — breaking the record for 

the world’s most expensive drug. Withso many com- 

pounds failing to reach the clinic, and the cost of 
turning a molecular entity into a therapy reaching billions, 
it’s no wonder that drugs have become so expensive. But 
we can do better. 

One way to reduce costs is to use genetic data to inform 
drug design. Genetic information helps researchers to 
demonstrate that drug targets are relevant to the disease 
from the start, and drugs with this evidence are twice as 
likely to be approved as those without (M. R. Nelson etal. 
Nature Genetics 47, 856-860; 2015). But we can further 
optimize drug discovery. If we start with a ‘deep’ molec- 
ular profile that includes data about the microbiome and 
genome, as well as information about metabolites and pro- 
teins (the metabolome and proteome), coupled with phys- 
iological measurements, we might be able, insome cases, 
to skip animal testing and move straight to human trials. 

The ability to start drug discovery with this in-depthinfor- 
mation is particularly useful for finding microbiome-re- 
lated therapies. Rather than focus on microbial targets 
with therapeutic value in animal models that might turn 
outto be rare inthe human microbiome, or that act through 
different mechanisms in people, we can start with microbes 
associated with human disease. Interventions to modify 
bacteria can also start directly in people. The idea is to 
collect dietary and microbiome data from many individ- 
uals, derive models of how diet affects composition of the 
microbiome, and then validate the models with controlled 
dietary interventions. With more than 5 million bacterial 
genes, the microbiome represents a prolific reservoir of 
modifiable targets with potentially therapeutic effects. 

The gut microbiome has been implicated in numerous 
conditions, including autoinflammatory disease, autism, 
cardiovascular disease and cancer. Growing evidence from 
animal and human studies suggests that it has causal effects 
in disease, such as by regulating host gene expression or 
by producing metabolites that circulate inthe blood. And 
because the microbiome is predominantly shaped not by 
genetics but by modifiable environmental factors suchas 
diet, this presents an opportunity to intervene. 


profiling 
allows the 
disease state 
or treatment 


response 
tobe 
modelled asa 
continuum.” 
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For example, dietary interventions are being targeted 
towards gut bacteria that synthesize the metabolite tri- 
methylamine N-oxide (TMAO). It has been suggested that 
elevated plasma levels of TMAO cause cardiovascular dis- 
ease; reducing TMAO production could, therefore, help to 
lower disease risk. And in neurological diseases, microbi- 
ome-derived metabolites that reach intestinal neurons 
could provide a means of getting metabolites through the 
barrier that separates the brain from circulating blood. 

The microbiome can also affect the efficacy of pharma- 
ceutical drugs. Bacterial enzymes, for example, can metab- 
olize the Parkinson’s disease drug L-DOPA, and gut bacteria 
canaffect a person’s response to cancer immunotherapy. 
Dietary changes targeting bacteria that interfere with drug 
metabolism could, therefore, be effective supplements to 
existing treatments. The regulatory path for approving 
such microbiome-nutrition interventions is much easier 
than for conventional pharmaceutical products. 

The development of microbiome-related therapeutics 
faces many challenges, including the need to establish 
causal mechanisms. However, even if these are unknown, 
we might still be able to use human microbiome data to 
devise therapies. For example, our team has targeted post- 
meal blood glucose levels, which are important in obesity 
and diabetes. We tracked blood glucose levels in 900 peo- 
ple, and collected data about their microbiome, genetics, 
metabolomics, diet and lifestyle (D. Zeevi et al. Cell163, 
1079-1094; 2015). We found that people respond differ- 
ently to the same meal, and devised a machine-learning 
algorithm that accurately predicted these personalized 
responses from clinical and microbiome data. In short- 
term and 12-month randomized controlled trials, we 
showed that personalized dietary interventions based on 
the algorithm successfully balanced glucose levels in peo- 
ple with higher than normal blood sugar — outperforming 
the standard-of-care diet. 

We need new approaches to drug development. 
Cohorts made up of rich molecular and physiological 
profiles from many volunteers offer one way to prioritize 
human-relevant targets for development. These cohorts 
must be constructed carefully — the type and depth of 
the data collected should be relevant to the disease being 
studied. Having multiple types of dataon the same people 
can be powerful for defining more exact targets and for 
discovering novel disease biomarkers and drug targets. 
Deep profiling also allows the disease state or treatment 
response to be modelled as acontinuum, avoiding the need 
for arbitrary thresholds that categorize people as respond- 
ers or non-responders, for example. This might lead to 
better estimates of disease risk and better prioritization 
of people to treatments and randomized controlled trials. 

Longitudinal measurements of the same people are also 
crucial. Such studies bypass confounding factors of inter- 
personal variability because the volunteers serve as their 
own control. Finally, because resources are always lim- 
ited, cohort size is an important consideration. A study of 
thousands or tens of thousands of participants still allows 
longitudinal, deep molecular profiling, but keeps costs 
realistic. Now is the time to use deep cohorts to end the 
era of laborious, costly, risky and time-consuming drug 
discovery. We can’t afford another world record. 
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Aresearcher tests donor stool before it is used for faecal microbiota transplantation. 


Engineering the microbiome 


Modified bacteria and carefully formulated microbial communities 
could form the basis of new living treatments. By Claire Ainsworth 


Ithough it happened almost a decade 
ago, Willem de Vos still vividly remem- 
bers his colleagues being told to halt 
the clinical trial they had been run- 
ning. De Vos was part of the team 
conducting the first randomized clinical trial 
of faecal microbiota transplantation (FMT) 
— faeces from healthy donors were used as a 
last-resort treatment for people with a dev- 
astating, recurrent gut infection caused by 
the bacterium Clostridium difficile. About a 
year in, the data and safety monitoring board 
overseeing the trial had seen enough: the 
trial needed to end. But it wasn’t because the 
therapy didn’t work — quite the opposite. The 
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transplants were proving so successful that 
it was no longer ethical to continue to give 
people in the control group the conventional 
antibiotic treatment with which the trans- 
plants were being compared. “That showed 
us that it worked and why it worked,” says 
de Vos, a microbiologist at the University of 
Wageningen in the Netherlands and the Uni- 
versity of Helsinki in Finland. The antibiot- 
ic-treated patients who relapsed were given 
the transplant, which cured them. 

The C. difficile story is one of agrowing list of 
examples of how the gut microbiome shapes 
our biology. The community of microbes that 
lives in the gut has been associated with many 
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aspects of our physiology — from conditions 
such as obesity to how the immune system 
functions and even mental health. The suc- 
cess of FMT in treating C. difficile also shows 
that, in principle, the ecology of the gut can be 
manipulated to treat disease. Now, scientists 
are attempting to engineer gut microbiota that 
will allow them to do just that. 

Synthetic biologists are working at the level 
of individual species, engineering gut bacteria 
not only to deliver therapeutic payloads but 
also to monitor and respond to conditions 
inside the body. Meanwhile, synthetic ecol- 
ogists are looking at the gut as an ecosystem 
and assembling communities of microbes that 


LEWIS HOUGHTON/SPL 


interact to produce substances or behaviours 
for medical benefit. Both approaches are in 
their infancy, and there are challenges to get- 
ting them into the clinic. Yet the technologies 
are already proving to be powerful tools, allow- 
ing scientists to explore the complex microbial 
interactions in our internal ecosystem. 


Bespoke bacteria 


Engineering individual microbes has an 
impressive array of potential applications. 
Gut bacteria have been altered to produce 
therapeutic molecules to treat metabolic con- 
ditions, kill pathogens and trigger immune 
responses to cancers. A strain of Escherichia 
coli engineered to produce the proteins 
needed to correct rare metabolic deficien- 
cies is now in clinical trials. And in 2018, a 
team in Singapore revealed gut bacteria that 
it had engineered to stick to colon cancer cells 
and secrete an enzyme that converts a sub- 
stance naturally found in vegetables such as 
broccoliinto a molecule that inhibits tumour 
growth. When given to mice with coloncancer, 
the treatment shrank tumours and reduced 
recurrence’. Bacteria can even be engineered 
to sense signs of disease and respond by pro- 
ducing therapeutic molecules. For example, in 
2017, researchers took a gut bacterium com- 
monly used asa probiotic and gave it the ability 
to detect communication signals produced 
by pathogenic bacteria. The probiotic bacte- 
rium then produces an antimicrobial molecule 
in response. The researchers showed that it 
helped clear infections in worms and mice’. 

Studies such as this show the potential of 
live therapeutics, but so far the engineered 
bacteria are comparatively straightforward 
systems — they produce a therapeutic mole- 
cule either at aconstant rate or inresponseto 
an environmental signal. Now, researchers are 
looking to broaden the scope of engineered 
microbes and engineer bacteria with DNA con- 
taining more complex elements designed to 
work like electronic circuits. This is the realm 
of synthetic biology, a discipline that aims to 
apply engineering principles — suchas stand- 
ardized, modular components — to biological 
systems. 

These complex feats of engineering are 
allowing bacteria to do simple computational 
tasks, suchas remembering a one-off stimulus 
long after it has passed. For example, a team 
of synthetic biologists led by Pamela Silver at 
the Wyss Institute for Biologically Inspired 
Engineering at Harvard University in Bos- 
ton, Massachusetts, engineered a bacterium 
to detect a chemical produced by inflamed 
gut cells. In response, the bacteria secrete a 
molecular signal, and continue to secrete it 
even if the gut inflammation dies down. The 


signal can be detected in stool samples, rais- 
ing the possibility of using the bacterium asa 
living diagnostic test for inflammatory bowel 
disease — which is often transient in nature 
and, therefore, hard to detect in the clinic. 
The bacteria formed a stable colony in the 
guts of mice for six months and responded to 
experimentally induced gut inflammation’. 
Importantly, engineered bacteria that can 
remember other kinds of environmental signal 
would allow researchers to explore conditions 
in different regions of the gut — something that 
is hard to do with conventional stool samples. 
“What we really would like is the bacteria to be 
like detectives and tell us what’s going on as 
they pass through,” says Silver. 

Getting a genetic circuit to work in the lab 
is hard enough. Translating that to the messy, 
competitive environment of the gut microbi- 
ome presents an even greater challenge. Any 
modification that imposes an extra burden 
— say, extra protein production — ona bacte- 
rium puts it at a disadvantage, resulting in that 
organism either being out-competed or ditch- 
ing its engineered function to survive. Partly 
for this reason, researchers have struggled to 
get many engineered bacteria to make the leap 
from test tube to animal models. Scientists 
are now working on ways around this; Silver, 
for example, is using genetic elements that 
naturally place a minimal burden on the cell. 

The final hurdle will be showing that engi- 
neered bacteria are effective and safe. What’s 
more, unlike conventional drugs, engineered 
bacteria could spread into the environment 
and share their DNA with other bacteria. 
Although the chances of them surviving in 
the wild are thought to be low, the possibility 
of unforeseen consequences (not to mention 
the need to secure public acceptance and regu- 
latory approval) has led researchers to explore 
a number of options to contain engineered 
bacteria, including kill switches that force 
bacteria to kill themselves with a toxin if their 
engineered circuits turn faulty or ifthey leave 
the body. 


Constructing communities 

While some researchers engineer individual 
bacteria, others are turning their attention to 
groups of microbes. Just as a city functions 
as a result of many people doing different 
jobs, the gut is a hive of interactions between 
myriad microbes carrying out different func- 
tions. Some interactions are metabolic — one 
bacterium might produce something that 
another consumes, for instance. Others 
are ecological, such as when one microbe 
inhibits the growth of another. By working 
together, communities of microbes produce 
molecules or behaviours that would not arise 


© 2020 Springer Nature Limited. All rights reserved. 


from organisms acting alone. 

These emerging properties of the gut micro- 
biome havea profound effect on our biology, 
such as by producing vitamins or molecules 
that modulate our immune responses. To 
understand these interactions and to devise 
newtherapies, researchers are building com- 
binations of different bacteria known as syn- 
thetic ecosystems. For the most part, these 
ecosystems are made up of naturally occurring 
bacterial strains, although some scientists are 
experimenting with ecosystems containing 
genetically engineered microbes. 

From atherapeutic point of view, synthetic 
ecosystems have anumber of potential advan- 
tages. FMT currently relies on faecal matter 
provided by donors. Stool samples contain 


“What we really would 
like is the bacteria tobe 
like detectives and tell us 
what’s going on.” 


highly complex mixtures of microbes that 
vary from donor to donor, and each must be 
screened for pathogenic microbes. If FMTs 
could be stripped down to just the key spe- 
cies needed to treat people, simplified path- 
ogen-free mixtures of these selected microbes 
could be grown in the lab. Synthetic commu- 
nities would offer a standardized therapeutic 
with aknown composition, and would lift the 
reliance on finding suitable donors. 

Research, including a few studies in people, 
suggest that this approach could work. Mix- 
tures of selected bacteria isolated from stool 
samples have shown promise in treating peo- 
ple with C. difficile. And it’s not just infections 
that could be tackled, but also conditions such 
as inflammatory bowel disease. In 2013, ateam 
led by scientists in Japan identified acommu- 
nity of human gut microbes that could pro- 
mote the activity of inflammation-damping 
immune cells called regulatory T cells, and 
showed that this could ameliorate inflamma- 
tory bowel disease in mice’. As well as devel- 
oping therapies, stripping down conventional 
FMTs is allowing scientists to work out which 
bacteria in stool transplants are exerting a 
therapeutic effect — something that de Vos and 
his colleagues are exploring in conditions such 
as inflammatory bowel disease and metabolic 
syndrome. 

One drawback of this stripping-down 
approachis that it limits the applications of the 
synthetic community to functions that already 
exist. There might be situations in which you 
would want to create a community witha 
new function, such as producing a vitamin 
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Synthetic biologists have engineered bacteria that remember the presence of a chemical and 
secrete a molecular signal that allows them to be identified. 


or degrading a toxin. Creating new functions 
requires designing from the bottom up — 
testing different combinations of microbes, 
including those that don’t normally co-exist 
in nature, until one gives the desired outcome. 
Doing this by trial and error inlab experiments 
soon becomes unwieldy, so instead research- 
ers have turned to computer modelling. 

The aim here is to predict the emergent 
properties ofa microbial community, based on 
expected interactions between the microbes 
present. A team led by Elhanan Borenstein at 
Tel Aviv University in Israel created computer 
models of the metabolic reactions inside 
individual microbes, and then modelled how 
these would behave in the presence of another 
microbe’s metabolism*. By simulating interac- 
tions between pairs of microbes, they showed 
how new metabolic products emerged that 
wouldn't be seen if the microbes acted alone. 
Models can simulate ecological interactions 
too, suchas how the abundance of one microbe 
affects the abundances of others. This can help 
scientists to design microbial communities 
that are stable and therefore persist over time. 


It’s the ecology 

Computer modelling and lab-grown 
communities allow researchers to gain a bet- 
ter understanding of how microbes in natu- 
ral communities in the gut interact — both 
with each other, and with their human hosts. 
De Vos’s team grew four different bacteria 
that usually live together in the mucus layer 
lining the gut®. One species, Akkermansia 
muciniphila, breaks the mucus down into 
compounds that other bacteria consume. The 
team showed that the other bacteria were not 
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only consuming these compounds, but were 
also feeding molecules they had made back to 
A. muciniphila and, in the case of butyrate, a 
fatty acid needed by the cells of the gut lining, 
to their host. 

Researchers are also gaining new insights 
into the relationships between microbes and 
between microbes and their host from the cre- 
ation of minimal microbiomes — constructed 
microbial communities containing the small- 
est number of species needed to create a stable 
ecosystem. A 2016 study showed how combin- 
ing a minimal microbiome with comparative 


“We're making progress and 
learning alot of interesting 
biology onthe way.” 


genomics can lead to the design of a microbial 
community with a desired property. Barbel 
Stecher at the Ludwig-Maximilians University 
of Munichin Germany and her team developed 
the Oligo-MM” minimal microbiome — a col- 
lection of 12 gut microbes that helps to pre- 
vent Salmonella enterica from colonizing the 
guts of mice lacking any bacteria of their own’. 
The 12 bacterial species excluded Salmonella 
almost, but not quite, as wellas a conventional 
microbiome. By using genomics to compare 
their minimal microbiome witha complex one, 
the researchers singled out the ecosystem 
functions that were missing from their com- 
munity, added three more bacterial species 
that could fill the gap, and produced a com- 
munity that was as good as the conventional 
one at keeping Salmonella out. Ultimately, 
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researchers hope that studies suchas this will 
allow them to design minimal microbiomes 
with defined therapeutic properties, such as 
producing butyrate or vitamins. 

Perhaps the eventual application of micro- 
biome engineering would be to combine 
synthetic biology and synthetic ecology. 
Scientists would create communities con- 
taining genetically-engineered microbes, the 
collective behaviour of which would deliver 
a therapeutic benefit. One advantage of this 
approach is that it would let engineers dis- 
tribute different metabolic tasks between 
different bacteria. This means all the physi- 
ological stress of making a drug ora vitamin 
would not be placed on just one bacterium. 
A number of teams have made progress in 
this area, including exploiting a system that 
bacteria use to detect the presence of other 
bacteria and to modify their gene activity in 
response. Researchers are using this feature, 
known as quorum sensing, to control the 
behaviour of mixed populations of bacteria, 
to, for example, allow bacteria that compete 
with each other to co-exist and forma stable 
population. 

The potential paybacks of engineering the 
gut microbiome are immense, but so are the 
challenges to reaching this goal. Of all the 
human microbiomes to take on, the gut micro- 
biome is by far the largest and most complex. 
Muchremains to be learnt about its denizens, 
their genes and their interactions. And that’s 
before you get started on what the human host 
brings to the party. Indeed, there is so much 
variation between individuals that it’s still not 
clear what a ‘healthy’ gut microbiome looks 
like (see page S6). 

Even so, the potential payoffs are motivating 
the scientists to aim high. Borenstein hopes 
one day to take information about an individ- 
ual — the microbes in their gut, their physiol- 
ogy, their diet and their genome — and use it to 
build a full-scale computer model of their gut 
microbiome. Such an advance might make it 
possible to design personalized interventions 
to treat or prevent disease. 

“This is not something we'll get to ina 
year, or two or five,” Borenstein admits. “But 
we're making progress and learning a lot of 
interesting biology on the way.” 


Claire Ainsworth is a freelance science 
journalist in Hampshire, UK. 
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Perspective: 
Use dietas atool, 
notatreatment 


Scientists can Sequence your 
microbiome, but they still 
can’t tell you what to eat to 
prevent or treat disease, says 
Peter J. Turnbaugh. 


recent report! about the health consequences 

of eating red and processed meat renewed 

long-standing debates about what evidence 

should be required before the public are told 

what foods they should avoid. For me, this subject 
hits especially close to home — my lab’s research shows 
that meat consumption alters the human and mouse gut 
microbiomes, and 1 am frequently asked if meat is “bad 
for gut microbes”. 

Our results indicate that meat consumption promotes 
the growth of bacteria that exacerbate mouse models of 
inflammatory bowel disease and decreases the levels of 
bacteria that metabolize fibre’. But our studies in both 
people and mice have been short, controlled and extreme 
interms of the level of meat consumption. Does this really 
reflect what happens in people on more typical diets? 

Basing dietary advice on microbiome studies also 
assumes that it is possible to predict the health effects of 
different microbial communities — a goal that is far from 
being realized. Even well-studied mechanisms have unclear 
consequences for human health. Take, for example, the 
production of short-chain fatty acids (SCFAs) from the bac- 
terial digestion of fibre. SCFAs act on multiple tissues and 
targets of interest in both bacterial and host cells. Further- 
more, not all SCFAs are the same, and they have not been 
considered in the context of products of carbohydrate 
and amino-acid fermentation — let alone the diversity of 
other microbial metabolites that could enhance or coun- 
teract the effects of these specific compounds. Therefore, 
although it is tempting to recommend diets that boost 
SCFAsto prevent metabolic and other diseases, the broader 
effects on the microbiome and its complex interactions 
with the host are hard to predict. 

Another massive gap in our knowledge is the degree 
to which the microbiomeis affected by, or mediates, the 
health consequences of our diet beyond macronutrients 
(fats, proteins and carbohydrates). Results from our lab 
have shown that the effect of raw potatoes on the gut 
microbiome is markedly different from that of cooked 
ones’. Although cooking-induced changes to carbohy- 
drates explained much of this effect, our data also sug- 
gest the broader chemical diversity of plants should be 
considered. Other research has highlighted the ability of 
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gut bacteria to inactivate potentially harmful chemicals 
formed during cooking*. These results make clear that 
researchers studying the interaction between diet and the 
microbiome need to consider not just the composition 
of foods, but also how they are prepared. 

The research questions pertaining to diet and the gut 
microbiome are wide open. It is all the more important, 
therefore, that we identify reliable ways to use the data 
that emerge as we improve our understanding of how the 
microbiome works. 

Two clear strategies for using microbiome data can be 
borrowed from more conventional drug development: 
biomarker discovery and target-driven screening. The use 
of gut microbes as biomarkers was pioneered by Eran Segal 
at the Weizmann Institute of Science in Rehovot, Israel, (see 
page S19) whose group used microbiome profiles alongside 
other data to predict blood glucose levels following a meal. 
Although the added value of microbiome data for that par- 
ticular application remains debated, the general limitation 
of these approaches is that they provide little information 
about the mechanisms involved. That makes it difficult to 
infer causal relationships, to develop mechanistic hypoth- 
eses, or to expand these predictors to incorporate future 
discoveries about the host or microbiome. 

The other approach, selecting specific bacterial targets 
for modulation by diet, has been applied to the design of 
diets to treat undernourished children®. Prototype diets 
were designed with the goal of promoting the growth 
of bacteria that are under-represented in children with 
severe acute malnutrition. Following extensive preclinical 
characterization of these microbes in mice and pigs, the 
researchers ran a randomized, double-blind controlled 
feeding study that provides preliminary support for effi- 
cacy in people. This target-driven approach offers aclear 
advantage over biomarker discovery — it is easy to see 
how future mechanistic insights into the components of 
the microbiome with compelling links to disease can be 
incorporated into the development of therapeutic diets. 

Asthe amount of research into the interactions between 
diet and the microorganisms that populate our gut grows, 
it is worth considering whether dietary recommendations 
are the best way to use our growing knowledge about the 
role of the microbiome in nutrition. Adjusting the type or 
quantity of food eatenis an attractive intervention, given 
its simplicity, but a restrictive diet can be difficult to main- 
tain over long periods of time — especially for people with 
severe disease. In the future, researchers might instead 
use diet as a discovery platform in humans and animal 
models to uncover specific species, genes or enzymes 
that could be targeted using conventional small molecule 
drugs, biologics or cell-based therapies. If so, rather than 
‘what should people eat?’ maybe the question should be 
‘how can we design the microbiome-based medicines of 
the future?’. 
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Research round-up 


Highlights from 
microbe studies. 
By Liam Drew 


The gut’s link to 
mental health 


Astudy ofa pair of 1.000-strong 
cohorts has strengthened the 
link between the community of 
microorganisms that live in the 
gut and mental health. 

Jeroen Raes at the Catholic 
University of Leuven, Belgium, 
and his team initially looked for 
links between the microbiome 
and depression and quality of 
life in participants in the Belgian 
Flemish Gut Flora Project. In 
this cohort, the team showed 
that two bacterial species were 
positively correlated with 
self-reported high quality of 
life, whereas a third was most 
abundantin people reporting 
low quality of life. A subsequent 
analysis that categorized people 
as having one of four types of 
microbiome found that people 
with depression more often 
had a type associated with low 
overall bacterial abundance. 
Using published genetic 
characterizations of gut-bacterial 
metabolic pathways, the team 
also showed around 50 routes by 
which various intestinal microbes 
can produce neuroactive 
metabolites. Among the 
pathways, it found an association 
between higher quality of life 
and the presence of bacteria 
that produce a metabolite of the 
neurotransmitter dopamine. 

The study frames hypotheses 
for future work to investigate 
whether the microbiome can 
affect mental health — and, if so, 
whether this happens through 
altered dopamine signalling. 


Nature Microbiol. 4, 623-632 
(2019) 


Villi of the small intestine are home to gut microbes. 


The key to faecal 
transplantation 


A follow-up analysis ofa 
landmark clinical trial of faecal 
microbiota transplantation 
(FMT) for ulcerative colitis 
might have identified bacterial 
species that could help to treat 
this form of inflammatory bowel 
disease. In a 2017 trial, 27% of 
people with ulcerative colitis 
entered full remission after 
receiving FMT — asuccess, but a 
partial one. 

Now, Nadeem Kaakoush at the 
University of New South Wales 
in Sydney, Australia, and his 
colleagues report on detailed 
analyses that link remission 
to features of the recipients’ 
intestinal microbiomes before 
and after treatment, and also 
to the bacterial composition of 
donor stool. 

People whose disease went 
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into remission had greater 
overall microbiota diversity both 
pre- and post-FMT. But following 
treatment, remission was 
associated with the enrichment 
of two species of bacteria: 
Eubacterium halliiand Roseburia 
inulinivorans. These microbes 
are thought to boost production 
of short-chain fatty acids 
(SFCAs) and the breakdown 
of starch. Indeed, the guts of 
people in remission showed an 
increase in SCFA production. 

The likelihood of successful 
treatment also correlated with 
the bacterial content of donated 
stool. Remission was more likely 
if the stool had high levels of 
Bacteroides species, whereas 
Streptococcus species were more 
common in samples that did not 
induce remission. 

The results could allow 
rational selection of both 
donors and recipients of FMT 
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for ulcerative colitis, and might 
make it possible for physicians 
to use particular bacterial 
strains to enhance responses or 
treat ulcerative colitis. 


Gastroenterology 156, 1440-1454 
(2019) 


Microbe boosts 
metabolic health 


According to a clinical trial, 

a daily dose of the bacterium 
Akkermansia muciniphila might 
treat metabolic syndrome — a 
condition that predisposes 
individuals to type 2 diabetes 
and serious cardiovascular 
disease, and that is marked by 
obesity, high blood pressure, 
and raised levels of blood 
sugar, fats and cholesterol. 

A. muciniphila is abundant in 
the guts of lean people and 


THOMAS DEERINCK, NCMIR/SPL 


its prevalence decreases with 
obesity. Ina proof-of-principle 
study, Belgian researchers 

gave people who were insulin- 
resistant and overweight or 
obese a preparation of either live 
or pasteurized A. muciniphila, or 
a placebo, daily for three months. 

Both the live and pasteurized 
bacteria had beneficial effects. 
Pasteurized A. muciniphila in 
particular lowered circulating 
insulin and total cholesterol 
levels and decreased insulin 
resistance. The microbes also 
reduced white blood cell counts, 
an indication that there was 
less overall inflammation. It 
is unclear why dead bacteria 
were more effective, but 
the results suggest that the 
bacteria’s metabolites or 
cell-wall fragments might be 
therapeutically active. 

Patrice Cani at the Catholic 
University of Louvain, Louvain- 
la-Neuve, Belgium, and his team 
note the study was small, with 
only about ten people per group, 
and neither abdominal fat nor 
body mass index were reduced. 
They are now looking at the 
effects of individual molecules 
in mice andare planning a larger 
trial of pasteurized A. muciniphila 
in humans. 


Nature Med. 25, 1096-1103 (2019) 


Bacterial effect on 
disease pathways 


Investigations of links between 
the gut microbiome and specific 
diseases tend to have a causality 
problem: it’s unclear whether an 
altered microbiome contributes 
to the disease or whether the 
disease alters the collection 

of microbes. But a team of 
researchers led by Serena Sanna 
and Cisca Wijmenga at the 
University of Groningen in the 
Netherlands and Mark McCarthy 
at the University of Oxford, UK, 
have addressed this issue using 
an analytical approach called 
Mendelian randomization — 

in which genetic variants are 
treated as manipulations that 


define experimental groups — to 
show that the microbiome can 
cause metabolic dysfunction. 
The team also suggested a 
mechanism by which this 
occurs: changes in microbial 
production of short-chain fatty 
acids (SCFAs). 

The metabolic health of nearly 
1,000 people in the Netherlands 
was found to correlate with the 
presence of certain bacteria and 
bacterial metabolic pathways. 
Most notably, higher faecal 
levels of butyrate — an SCFA 
produced by gut microbes 
— predicted better insulin 
responses. The study also 
showed that participants’ genes 
partially predict the level of 
butyrate production in people’s 
guts, and microbiome structure 
more broadly — a finding 
confirmed ina different group 
of more than 4,000 individuals. 

Then came the test of 
causality. If altered insulin 
sensitivity changes the 
microbiome (rather than the 
microbiome disrupting insulin 
physiology), all genetic factors 
known to influence insulin 
sensitivity should also predict a 
person’s butyrate production. 
But they did not. This suggests 
that genes associated with both 
microbiome structure and 
insulin responses influence gut 
microbiomes, which in turn 
disrupt insulin signalling. 

The team also found that 
genetic variants elevating faecal 
levels of another bacteria- 
derived SCFA, propionate, 
increase the risk of type 2 
diabetes. These findings pave 
the way for more personalized 
treatments of metabolic disease. 


Nature Genet. 51, (00-605 (2019) 


Childbirth andthe 
microbiome 


Aperson’s gut microbiome is 
seeded in early life according to 
the bacteria they are exposed 

to and howsuccessfully these 
microbes colonize the intestines. 
Research, led by Trevor Lawley 


at the Wellcome Sanger Centre 
in Hinxton, UK, and Nigel Field 
at University College London, 
shows that a caesarean-section 
birth radically affects newborns’ 
microbiomes, and that infants 
born this way are often host to 
opportunistic hospital bacteria. 

Looking at 596 healthy babies 
shortly after birth and again at 
8-10 months old, the researchers 
showed that when babies were 
4 days old, the microbiomes of 
those delivered by c-section were 
markedly different from those 
of babies born vaginally. And 
although the microbiomes of 
c-section infants gradually shifted 
closer to those of vaginally born 
babies over the first three weeks 
of life, significant differences per- 
sisted into infancy. 

One of the most pronounced 
differences in the microbiomes 
of c-section babies was alow 
abundance of Bacteroides. 

Levels of bacteria commonly 
found in hospitals in the 
microbiomes of c-section 
newborns did decrease sharply 
in the months after birth, but 
those organisms were still 
slightly more common than in 
vaginally born babies at around 
eight months. The authors 
attributed the differences to 
infants born by c-section having 
less exposure to the maternal 
microbiome during birth. 

But the study also found that 
when mothers who gave birth 
vaginally were given prenatal, 
prophylactic antibiotics — which 
are also given to nearly all 
women who have c-sections — 
their babies’ microbiomes were 
also low in Bacteroides bacteria. 

Both c-section delivery and 
antibiotic exposure have been 
implicated in the development 
of childhood allergies. Although 
the study does not provide 
direct evidence that an altered 
microbiome is the mechanistic 
link between these events and 
illness, it does raise the question 
of whether a drastically different 
initial gut microbiome has long- 
term consequences. 


Nature 574, 117-121 (2019) 


Weightloss without 
microbial gains 


Frenchscientists have examined 
the microbiomes of people 

with severe obesity, and how 
the communities of microbes 
changed after bariatric surgery. 
The research, by Karine Clément 
at Sorbonne University, Paris, 
and her co-workers, showed that 
most people who are severely 
obese have impoverished 

gut microbiomes — and that 
post-surgical weight loss and 
improved metabolic health 
were not accompanied by a full 
recovery of microbial diversity. 

Microbiomes were analysed 
by quantifying the overall 
genomic diversity and 
measuring the serum levels 
of microbiome-associated 
metabolites before and after 
surgery. Three-quarters of 
participants had low microbial 
gene richness — compared with 
20-40% of people who were 
moderately obese. When the 
authors looked for correlations 
between metabolites and 
microbiome structure, they 
found nine metabolites were 
affected by changes to the 
microbiome. 

Bariatric surgery — either 
a gastric band or a bypass — 
increased microbial diversity by 
25-40%, but, on average, levels 
remained lower than those of 
people of a healthy weight. This 
held true one year after surgery, 
when weight loss was maximal. 
Insome people, the comparative 
lack of diversity persisted five 
years after surgery. 

The study suggests that 
applying strategies to correct 
the gut dysbiosis associated 
with severe obesity, alongside 
surgery, could further improve 
metabolic function. 


Gut 68, 70-82 (2019) 
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Multi-omics of the gut microbial 
ecosystem in inflammatory bowel diseases 
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Inflammatory bowel diseases, which include Crohn’s disease and ulcerative colitis, affect several million individuals 
worldwide. Crohn’s disease and ulcerative colitis are complex diseases that are heterogeneous at the clinical, 
immunological, molecular, genetic, and microbial levels. Individual contributing factors have been the focus of 
extensive research. As part of the Integrative Human Microbiome Project (HMP2 or iHMP), we followed 132 subjects 
for one year each to generate integrated longitudinal molecular profiles of host and microbial activity during disease 
(up to 24 time points each; in total 2,965 stool, biopsy, and blood specimens). Here we present the results, which provide 
a comprehensive view of functional dysbiosis in the gut microbiome during inflammatory bowel disease activity. We 
demonstrate a characteristic increase in facultative anaerobes at the expense of obligate anaerobes, as well as molecular 
disruptions in microbial transcription (for example, among clostridia), metabolite pools (acylcarnitines, bile acids, and 
short-chain fatty acids), and levels of antibodies in host serum. Periods of disease activity were also marked by increases 
in temporal variability, with characteristic taxonomic, functional, and biochemical shifts. Finally, integrative analysis 
identified microbial, biochemical, and host factors central to this dysregulation. The study’s infrastructure resources, 
results, and data, which are available through the Inflammatory Bowel Disease Multi’ omics Database (http: //ibdmdb. 
org), provide the most comprehensive description to date of host and microbial activities in inflammatory bowel diseases. 


Inflammatory bowel diseases (IBD) affect more than 3.5 million 
people, and their incidence is increasing worldwide!. These diseases, the 
most prevalent forms of which are Crohn’s disease (CD) and ulcerative 
colitis (UC), are characterized by debilitating and chronic relapsing 
and remitting inflammation of the gastrointestinal tract (for CD) or 
the colon (in UC). These conditions result from a complex interplay 
among host”?, microbial*~®, and environmental’ factors. Drivers of 
IBD in the human genome include more than 200 risk variants, many 
of which are responsible for host-microbe interactions’. Common 
changes in the gut microbiome in individuals with IBD include an 
increase in facultative anaerobes, including Escherichia coli®, anda 
decrease in obligately anaerobic producers of short-chain fatty acids 
(SCFAs)*. Here, to support a systems-level understanding of the aetiol- 
ogy of the IBD-associated gut microbiome that goes beyond previously 
reported metagenomic profiles, we introduce the IBDMDB, as part of 
the Integrative Human Microbiome Project. 

We recruited 132 participants from five academic medical cen- 
tres (three paediatric sub-cohorts: Cincinnati Children’s Hospital, 


Massachusetts General Hospital (MGH) Pediatrics, and Emory 
University Hospital; and two adult cohorts: MGH and Cedars- 
Sinai Medical Center; Fig. 1a, Extended Data Table 1, see Methods). 
Individuals not diagnosed with IBD on the basis of initial endoscopic 
and histopathologic findings were classified as ‘non-IBD’ controls. We 
analysed 651 biopsies (baseline) and 529 blood samples (approximately 
quarterly), which were collected in the clinic, and 1,785 stool samples, 
which were collected every two weeks using a home shipment pro- 
tocol for one year (Fig. 1b). The latter yielded primarily microbially 
focused profiles: metagenomes (MGX), metatranscriptomes (MTX), 
proteomes (MPX), metabolomes (MBX), and viromes (VX) at several 
‘global’ time points across all subjects (Fig. 1b), as well as denser, more 
intensive sampling from individuals with more variable disease activity 
(see Methods, Extended Data Fig. la-d). We generated multiple meas- 
urement types from many individual stool specimens, including 305 
samples that yielded all stool-derived measurements, and 791 MGX- 
MTX pairs (Fig. 1c, Extended Data Fig. 1b). Biopsies yielded host- 
and microbe-targeted human RNA sequencing (RNA-seq (HTX)), 
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Fig. 1 | Multi-omics of the IBD microbiome in the IBDMDB study. 

a, Overview of cohort characteristics. We followed 132 participants (with 
CD, with UC, or without IBD (control)) for one year each. Principal 
component analysis (PCA) of SNP profiles shows that the resulting 
IBDMDB cohort is mostly of European ancestry as compared to the 1000 
Genomes (1kG) reference (see Methods). b, Sampling strategy. The study 
yielded host and microbial data from colon biopsy (baseline), blood 
(approximately quarterly), and stool (every two weeks), assessing global 
time points for all subjects and dense time courses for a subset. Raw, non- 
quality-controlled sample counts are shown. c, Overlap of multi-omic 
measurements from the same sample (strict) or from near-concordant 
time points (with differences of up to 2 or 4 weeks; see Methods). 

d, Principal coordinates analysis (PCoA) based on species-level Bray— 
Curtis dissimilarity; most variation is driven by a tradeoff between phylum 
Bacteroidetes versus Firmicutes. Samples from individuals with IBD 

(CD in particular) had weakly lower Gini-Simpson alpha diversity 


epigenetic reduced representation bisulfite sequencing (RRBS), and 
16S rRNA gene amplicon sequencing (16S), which were matched with 
human exome sequencing, serological profiles, and RRBS from blood. 
All data are available at https://ibdmdb.org/. 


Multi-omic gut microbiome changes in IBD 

Consistent with prior studies*®, although subsets of IBD (CD in par- 
ticular) contributed to the second axis of taxonomy-based principal 
coordinates (Fig. 1d, Extended Data Fig. 2a), inter-individual variation 
accounted for the majority of variance for all measurement types”™!” 
(Fig. le, f, Extended Data Fig. 2a). Even relatively large effects, such 
as disease status or physiological and technical factors, explained a 
smaller proportion of variation (Fig. 1f); this was true across measure- 
ment types, although these captured distinct aspects of IBD dysbiosis 
(see below). 

Most measurement types captured correlated changes among 
and within subjects, cross-sectionally and longitudinally (Fig. le). 
Functional profiles, measured from MGX, MTX, and MPX, were 
the most tightly coupled (Fig. le), although some individual feature- 
wise correlations were weak (Spearman's correlation MGX-MTX 
0.44 + 0.10 (mean + s.d.), MGX-—MPX 0.14 + 0.083, and MTX- 
MPX 0.18 + 0.0961; Extended Data Fig. 2b). Unexpectedly, charac- 
terized enzymes tended to be only weakly correlated with their known 
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(Wald test P = 0.26 and 0.014 for UC and CD compared with non-IBD, 
respectively). e, Mantel tests quantifying variance explained (square of 
Mantel statistic) between measurement type pairs, with differences across 
subjects (inter-individual) or within subjects over time (intra-individual; 
see Methods); results show tight coupling across measurement types. 
Sample sizes in f. f, PERMANOVA shows that inter-individual variation 

is largest for all measurement types, with even relatively large effects 

(for example, antibiotics or IBD phenotype) capturing less variation 

(see Methods). Stratified tests (CD/UC) consider only samples within the 
indicated phenotype (note that sample counts decrease for these, resulting 
in larger expected covariation by chance). Stars show FDR-corrected 
statistical significance (FDR *P < 0.05, **P < 0.01, ***P < 0.001). 
Variance is estimated for each feature independently (Methods). ‘All’ refers 
to a model with all metadata. Total n for each measurement type is shown 
in square brackets, distributed across up to 132 subjects (Extended Data 
Fig. 1a, see Methods). 


substrates or products (Supplementary Fig. 1). Although our dietary 
characterization was obtained through a very broad-level food fre- 
quency questionnaire, it provides an initial characterization of longitu- 
dinal diet-microbiome coupling in a substantial population over many 
months; diet accounted for a small but significant 3% (false discovery 
rate (FDR) P = 7.4 x 10~*) of taxonomic variation between subjects, 
and 0.7% (FDR P = 4.3 x 107“) of variation longitudinally. 

Simple cross-sectional differences between individuals with IBD and 
those without (Supplementary Tables 1-14) were most apparent in the 
metabolome (Figs. 1f, 2a, Extended Data Fig. 2a, c, d, see Methods). 
Overall, metabolite pools were less diverse in individuals with IBD, par- 
alleling previous observations for microbial diversity (Supplementary 
Table 2); this might be caused by poor nutrient absorption, greater 
water or blood content in the bowels, and shorter bowel transit times 
in individuals with active IBD1!. The smaller number of compounds 
that were more abundant in patients with IBD included polyunsatu- 
rated fatty acids such as adrenate and arachidonate. Pantothenate and 
nicotinate (vitamins B5 and B3, respectively) were particularly depleted 
in the gut during IBD; this is notable because these are not typically 
among the B vitamins that are deficient in the serum of patients with 
IBD”, although low nicotinate levels have been detected during active 
CD}. Both vitamins are required to produce cofactors used in lipid 
metabolism", and nicotinate has anti-inflammatory and anti-apoptotic 
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Fig. 2 | Metagenomic, metatranscriptomic, and stool metabolomic 
profiles are disrupted during IBD activity. a, Relative abundance 
distributions for ten of the most cross-sectionally significantly 
differentially abundant metabolites in samples from individuals with IBD, 
as a ratio to the median relative abundance in individuals without IBD 
(Wald test; all FDR P < 0.003; see Methods; Supplementary Tables 1-14). 
Left, fraction of samples below detection limit (see Methods). n = 546 
samples from 106 subjects. b, Relationships between two measures of 
disease activity: patient-reported (Harvey—Bradshaw index (HBI) in CD, 
n = 680 samples from 65 subjects; simple clinical colitis activity index 
(SCCAI) in UC, n = 429 samples from 38 subjects) and host molecular 
(faecal calprotectin (cal)**, n = 652 samples from 98 subjects). Linear 
regression shown with 95% confidence bound. ¢, d, Distribution of 
microbial dysbiosis scores as a measure of disease activity (c, median 
Bray-Curtis dissimilarity between a sample and non-IBD samples; 
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functions in the gut!*. Notably, nicotinuric acid, a metabolite of 
nicotinate!®, was found almost exclusively in the stool of patients with 
IBD. Faecal calprotectin and the Harvey—Bradshaw Index (HBI), two 
measures of disease severity in CD, showed no significant correlation, 
whereas the Simple Clinical Colitis Activity Index!” (SCCAI) in UC did 
correlate weakly with faecal calprotectin levels (Fig. 2b). 

Notably, no metagenomic species were significantly different between 
samples from individuals with IBD and those from control individ- 
uals after correction for multiple hypothesis testing (Supplementary 
Table 1), in contrast with previous work*>"'’. We hypothesized this 
was due to the differentiation of study participants into two subsets, 
one with relatively inactive IBD (due to remission or recent onset) and 
the other with greater activity. This differentiation has been observed 
in several cohorts of patients with IBD~'®, but it is more pronounced 
here because we did not take samples specifically from subjects selected 
for active disease. We therefore classified samples with taxonomic com- 
positions highly unlike those of non-IBD control samples as ‘dysbiotic 
(Fig. 2c, Extended Data Fig. 3a—e, see Methods). Dysbiotic excursions 
in this cohort did not correspond with disease location (for example, 
ileal CD; F-test P = 0.11, see Methods), and occurred longitudinally 
within subjects; they were weakly correlated with patient-reported and 
molecular measures of disease activity (Fig. 2d, Extended Data Fig. 3a). 
In total, 272 dysbiotic samples were taken during 78 full periods of 
dysbiosis and 9 censored periods (that is, subjects who were dysbiotic at 
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see Methods) and its relationship with calprotectin (d, n = 652 samples 
from 98 subjects). Linear regression with 95% confidence. e, Kaplan- 
Meier curves for the distributions of the durations of (left) and intervals 
between (right) dysbiotic episodes in UC and CD. Both are approximately 
exponential (fits in dashed lines), with means of 4.1 and 17.2 weeks, 
respectively, for UC, and 7.8 and 12.8 weeks for CD (see Methods). 

f, Relative abundance distributions of significantly different metagenomic 
species (n = 1,595 samples from 130 subjects), metabolites (n = 546 
samples from 106 subjects), and microbial transcribers (n = 818 samples 
from 106 subjects) in dysbiotic samples compared to non-dysbiotic 
samples from the same disease group (Wald test; all FDR P < 0.05; full 
results in Supplementary Tables 15-28). Also shown are antibody titres 
for ANCA, ASCA (IgG or IgA), anti-OmpC, and anti-CBir1 antibodies 

(n = 146 samples from 61 subjects). Boxplots show median and lower/ 
upper quartiles; whiskers show inner fences; sample sizes above boxes. 


the end of the time series, see Methods), or 17.1% of all samples (n = 178 
(24.3%) in CD and n = 51 (11.6%) in UC). Plots of the durations of 
and times between dysbiotic periods were approximately exponential, 
suggesting that transitions are triggered, at least in part, by events with 
constant probability over time (and are thus potentially stochastic; 
Fig. 2e). 

Using the resulting definition of dysbiosis, dysbiotic periods 
corresponded to a larger fraction of variation in all measurement types 
than did overall IBD phenotype (Fig. 1f, Supplementary Tables 15-28); 
this is likely to reflect a clearer delineation between active and less 
active disease states within extremely heterogeneous subjects over 
time. Though it is unclear which aspects of dysbiosis are causes or 
consequences of IBD, characterization of these changes will lead to 
greater understanding of microbial dynamics in disease. As in previ- 
ous cross-sectional studies of established disease’, differences between 
dysbiotic and non-dysbiotic samples from individuals with CD were 
more pronounced than in those from individuals with UC (Fig. 1). 
Notably, dysbiosis also distinguished between independent host meas- 
ures, such as individuals with high and low ASCA (anti-Saccharomyces 
cerevisiae antibodies), ANCA (anti-neutrophil cytoplasm antibodies), 
OmpC (outer membrane protein C), and CBirl (anti-flagellin) 
antibody titres in serological profiles (Fig. 2f; Fisher's combined 
probability test P = 0.00044 from Wilcoxon tests between dysbiotic 
and non-dysbiotic CD). Dysbiosis was not significantly associated 
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with demographics or medication (logistic regression with subject as 
random effect, all FDR P > 0.05). Dysbiosis recapitulated a known 
decrease in alpha diversity in active disease, but we also identified 
numerous communities with normal complexity as dysbiotic (Extended 
Data Fig. 4a). Notably, taxonomic perturbations during dysbiosis mir- 
rored those previously observed cross-sectionally in IBD®, such as the 
depletion of obligate anaerobes including Faecalibacterium prausnitzii 
and Roseburia hominis in CD and the enrichment of facultative anaer- 
obes such as E. coli (Fig. 2f, Extended Data Fig. 4b). Ruminococcus 
torques and Ruminococcus gnavus, two prominent species in IBD!, 
were also differentially abundant in dysbiotic CD and UC, respectively 
(FDR P = 0.041 and 0.0087. A smaller subset of species also increased 
significantly in transcriptional activity (mean total transcript relative 
abundance relative to genomic abundance; see Methods) as well as 
showing differences in abundance, including Clostridium hathewayi, 
Clostridium bolteae, and R. gnavus (Fig. 2f). All had significantly 
increased expression during dysbiosis (all FDR P < 0.07), and thus 
their roles in IBD may be more pronounced than suggested solely by 
their differences in genomic abundance. 

In the metabolome, SCFAs were generally reduced in dysbiosis 
(Fig. 2f). The reduction in butyrate in particular is consistent with 
the previously observed depletion of butyrate producers® such as 
E prausnitzii and R. hominis, which was also observed here (Fig. 2f). 
We also detected enrichment of the primary bile acid cholate and its 
glycine and taurine conjugates (glycocholate q = 5.2 x 107°, tauro- 
cholate q = 1.3 x 10~°) in dysbiotic samples from participants with 
CD, when compared with non-dysbiotic samples. Similarly, glycoche- 
nodeoxycholate (q = 1.1 x 107“) was also enriched. By contrast, the 
secondary bile acids lithocholate and deoxycholate (q =5 x 107’ and 
q=1.8 x 1074, respectively) were reduced in dysbiosis, suggesting 
that secondary bile-acid producing bacteria are depleted in IBD-related 
dysbiosis, or that transit time through the colon is too short for these 
compounds to be metabolized”*”!. These significant metabolomic 
differences during microbial dysbiosis, which were concordant with 
changes expected during disease, provide further evidence that the 
dysbiosis measure is specifically relevant in IBD. 

We also observed several previously undescribed biochemical dif- 
ferences during dysbiosis, such as large changes in acylcarnitine levels. 
Many acylcarnitines were significantly enriched in dysbiosis (all FDR 
P < 0.05; see Extended Data Fig. 4c), whereas levels of base metabo- 
lites were typically reduced (Fig. 2f, Extended Data Fig. 4d). Of note, 
however, arachidonoyl carnitine (C20:4 carnitine) was reduced, and 
free arachidonate, a precursor of prostaglandins involved in inflam- 
mation, was increased (Fig. 2a). Like bile acids, carnitines are microbi- 
ally modified compounds that can have competing phenotypic effects 
depending on the precise modifications: L-carnitine, for example, tends 
to be anti-inflammatory, whereas fatty acid-conjugated carnitine does 
not act uniformly on gut inflammation’. These opposing changes in 
biochemically related metabolites further suggest that the differences 
seen during dysbiosis do not stem simply from the wholesale dilution 
of stool. Numerous other metabolites were also significantly altered in 
individuals with dysbiotic IBD (117 of 548 tested known metabolites 
with FDR P < 0.05; Extended Data Fig. 4d, Supplementary Table 16), 
showing large-scale dysregulation of metabolite pools in tandem with 
host- and microbiome-specific taxonomic and molecular features 
(Fig. 2f). Finally, although we found only a single, poorly characterized 
bacteriophage to be differentially prevalent in both IBD and dysbiosis 
(notably with reduced prevalence in IBD; Supplementary Tables 3, 17), 
we note that several participants showed a spike in viral load before a 
dysbiotic period (Supplementary Fig. 2). 


Decreased gut microbiome stability in IBD 

Our dense time series for stool-derived multi-omics from many sub- 
jects enabled us to carry out in-depth longitudinal analysis, integrating 
multiple measurements of the microbiome. Each subject’s microbiome 
tended to diverge more from the baseline over time for metagenomic, 
metatranscriptomic, and metabolomic profiles (Fig. 3a; F-test power 
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law fit P< 1074; see Methods). These changes were most pronounced 
for the taxonomic profiles of individuals with CD and UC (F-test dif- 
ference in power law fits P < 10~°), where a the microbiome of an 
individual may have almost no species in common with itself at an 
earlier time point (dissimilarity of 1; Fig. 3a), consistent with previous 
observations’. Transcripts summarized within species (Extended Data 
Fig. 5a) showed similar trends (all F-test P < 8 x 10~*) to metagen- 
omic species abundances. Meanwhile, gene family transcripts (Kyoto 
Encyclopedia of Genes and Genomes (KEGG) Orthologues (KOs)), 
metabolites (Fig. 3a), and proteins (KOs, Extended Data Fig. 5a) var- 
ied much more rapidly, with essentially as much change after around 
two weeks as over longer time periods (increasing trends less or not 
significant: non-IBD, UC and CD F-test P = 0.0006, 0.001, and 0.04, 
respectively for transcripts; 0.02, 0.06, and 0.003 for metabolites; and 
0.5, 0.15, and 0.06 for proteomics). This indicates that these features 
vary rapidly in the guts of individuals with and without IBD and lack 
additional, more extreme excursions during disease. 

We further characterized large-scale temporal differences by search- 
ing for ‘shifts’ in the microbiome between consecutive time points, 
defined as Bray—Curtis dissimilarities more similar to those between 
different people than within one person (Fig. 3a, Extended Data Fig. 5b, 
see Methods). First, considering only metagenomic taxonomic profiles, 
we found 166 such shifts, with 39 in individuals without IBD (of 382 
total possible), 44 in individuals with UC (of 381), and 83 in individuals 
with CD (of 650) (Supplementary Table 29). Owing to differences in 
total observation times, the rate of shifts was only marginally higher 
in individuals with CD or UC than in non-IBD participants (2.09 and 
1.83 shifts per year, respectively, compared with 1.79), and these were 
generally confined to a subpopulation of dysbiotic individuals (Fig. 3a). 
However, the species with the greatest changes in relative abundance 
differed markedly (Fig. 3b). Shifts in individuals without IBD occurred 
primarily in individuals with high abundances of Prevotella copri, which 
underwent repeated expansion and relaxation cycles over the course of 
weeks to months (Fig. 3c). This organism is of particular interest owing 
to its behaviour as a population-scale outgroup and its enrichment 
during new-onset rheumatoid arthritis”’. The lack of shifts due to 
P copri in participants with IBD was not due to an absence of P. copri 
in these individuals or an overabundance in those without IBD (6 of 
27 non-IBD subjects had at least one time point with more than 10% 
P. copri, consistent with healthy populations!®*), Instead, the relative 
abundances that were present remained more stable in the population 
with IBD (Fig. 3c). Taxonomic shifts in participants with IBD mirrored 
earlier observations of relative reductions in obligate anaerobes and 
overgrowth of facultative anaerobes (Fig. 3b, Extended Data Fig. 5c), 
and frequently corresponded with entry into and exit from dysbiosis 
(28 and 23 shifts marked entries and exits in IBD, respectively, account- 
ing for 40% of shifts in IBD). E. coli in particular contributed to a large 
number of shifts in IBD, although there was no clear pattern in which 
species it traded abundance with (Extended Data Fig. 5c, d). 

When we define shifts in a similar manner for metabolomics profiles 
(Extended Data Fig. 5e), the rate of shifts is approximately half that seen 
for the metagenome (1.05 shifts per year in participants without IBD, 
0.99 shifts per year in UC and 1.36 shifts per year in CD), although 
these data were strongly affected by the availability of fewer metabo- 
lomics samples (Extended Data Fig. 5e). We examined differences in 
metabolite profiles between adjacent samples from the same subjects 
and found significant separation by diagnosis (Fig. 3d; PERMANOVA 
P< 107‘). These differences were largely driven by unknown com- 
pounds, emphasizing the need for further compound annotation efforts 
and follow up to determine the significance of these compounds in 
IBD. Features with the greatest differences included urobilin (which 
showed larger differences in individuals without IBD), urate (largely in 
patients with CD), and a feature with an m/z of 152.0354 and retention 
time (RT) of 4.16 min (potentially the formic acid adduct of pyridinal- 
dehyde), which accounted for differences largely specific to UC. The 
primary contributors to shifts were largely unidentified compounds 
(Extended Data Fig. 5f, Supplementary Table 30). HILp_QI22918, 
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Fig. 3 | Temporal shifts in the microbiome are more frequent and 

more extreme in IBD. a, Bray-Curtis dissimilarities within subjects as a 
function of intervening time difference, as compared to different people 

or technical replicates; calculated for metagenomic taxonomic profiles 
(species; n = 1,595 samples from 130 subjects), metabolomics (n = 546 
samples from 106 subjects), and functional profiles (KO*” gene families; 
n= 818 samples from 106 subjects). Boxplots show median and lower/ 
upper quartiles; whiskers show inner fences. Blue, least-squares power-law 
fits; orange, thresholds for microbiome shifts (see Methods). Proteomics 
and species-level transcripts in Extended Data Fig. 5a. Within-subject 
changes are significantly more extreme in UC and CD than in non-IBD for 
taxonomic profiles (F-test P = 3.9 x 10-!° and 1.2 x 10~'8, respectively) 
and transcripts (P = 0.00016 and 1.7 x 10-5), with mixed differences 

for metabolites (P = 0.012 and 0.23). Technical replicates shown (when 
possible) at 0 weeks. b, Shift frequencies for the top 10 species with greatest 
change during shifts, ranked by number of shifts as primary contributor, 


an unknown feature with m/z of 648.43067 and RT of 5.03 min, contrib- 
uted the most (ten) shifts exclusively in individuals with IBD. Among 
known compounds, methylimidazole acetic acid and urate were the 
primary contributors to the most shifts (four shifts each; Fig. 3e). 


Microbiome-associated host factors 

When we incorporated host molecular measurements, primarily from 
intestinal biopsies taken colonoscopically at baseline, into our analysis 
of the microbiome in IBD, the main influences on population varia- 
bility were strikingly different from those that affected the microbiota 
alone. In particular, tissue location was a major driver of intestinal 
epithelial gene expression (Extended Data Fig. 2c) even in the face of 
microbial variation’® (Extended Data Fig. 2d). We therefore performed 
microbiome and phenotypic association analyses independently for 
each standardized biopsy location (see Methods). 

We identified genes that were significantly differentially expressed 
(DEGs) in patient biopsies taken in inflamed locations of the ileum 
(from individuals with CD) and rectum (both CD and UC) compared 
to individuals without IBD (Extended Data Fig. 6a). This analysis 
identified 305 and 920 genes, genes that were differentially expressed 


stratified by disease phenotype(s) (full table Supplementary Table 29). 

c, P. copri is of interest in arthritis** and international populations**, and 

it alone retained stable abundances in CD but bloom-relaxation dynamics 
in controls (two-tailed Wilcoxon test of absolute differences between 
consecutive time points P = 4.2 x 10~° between non-IBD and UC, and 

1.1 x 10~* between non-IBD and CD). Plot shows 22 subjects with at 
least one time point with more than 10% differential abundance (n = 267 
samples). d, Ordination of temporally adjacent samples within individual, 
based on metabolomics (Bray—Curtis principal coordinates on normalized 
absolute abundance differences). Disease groups separate significantly 

(n = 440 sample pairs from 106 subjects; PERMANOVA R? = 2.8%, 

P < 107). Urobilin, urate, and an unidentified untargeted feature 

that segregates with disease groups in the PCoA are shown (right); 
HILn_QI1594 (HILIC-neg method m/z = 152.0354, RT = 4.16 min). 

e, As inc, but for urate (two-tailed Wilcoxon test P = 0.0012 non-IBD-UC, 
P= 0.044 non-IBD-CD; n = 546 samples from 106 subjects). 


(primarily overexpressed) in the ileum and rectum, respectively, 
for further analysis (together representing 1,008 unique genes, neg- 
ative binomial model FDR P < 0.05 and fold-change >1.5; Fig. 4a, 
Supplementary Table 31). These included genes that can affect com- 
mensal microorganisms directly, such as the antimicrobial CXCL6 
(a cell membrane disruptor”°) and SAA2 (which inhibits growth of 
Gram-negative bacteria’’), as well as indirect microbial modulators 
such as DUOX2 (which produces reactive oxygen species”*) and 
LCN2 (which induces microbial iron starvation through sequestra- 
tion”’; Fig. 4b). Enrichment analysis testing for overrepresentation of 
KEGG* pathways among DEGs also confirmed strong representation 
of immune-related pathways (one-sided hypergeometric test, FDR 
P < 0.05). In particular, the IL-17 signalling pathway, components 
of which have been previously identified in gene expression studies 
of ileal biopsies from patients with CD?!””, was enriched in upregu- 
lated DEGs in both ileum and rectum (FDR P = 2.8 x 107"; Fig. 4a, 
Supplementary Table 32). Among upregulated DEGs in rectal biopsies 
from patients with UC, we found further enrichment of the complement 
cascade (FDR P = 4.4 x 107!°), a component of innate immunity* that 
has been implicated in IBD?>***5, 
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Fig. 4 | Colonic epithelial molecular processes perturbed during 
IBD and in tandem with multi-omic host-microbe interactions. 

a, Human DEGs (negative binomial FDR P < 0.05, minimum fold 
change 1.5; Supplementary Table 31) from 81 subjects with paired ileal 
and rectal biopsies. Ordering by diagnosis, clustering within diagnosis. 
IL-17 signalling (I) showed strongest enrichment in ileal DEGs (FDR 
P= 8.2 x 10~!”)3!, while the complement cascade (II)* was enriched 
in rectal DEGs from patients with UC (FDR P = 5.2 x 10-*; KEGG*” 
gene sets, Supplementary Table 32). Example DEGs shown with I and II. 
b, Expression of four genes involved in host-microbe interactions**”’. 
Inflamed biopsy samples are shown for CD from ileum (left, n = 20, 
23, 39 independent samples for non-IBD, UC, CD respectively); for CD 
and UC in rectum (right column; n = 22, 25, 41 independent samples 
for non-IBD, UC, CD); non-IBD samples were non-inflamed. Asterisks 
indicate significant differential expression compared to non-IBD 
(Fisher's exact test, FDR P < 0.05; P values in Supplementary Table 31). 


To identify the components of the microbiome that were most asso- 
ciated with these changes, we tested for transcripts that covaried with 
the relative abundance of microorganisms measured directly from 
the same specimens using 16S amplicon sequencing. We identified 
31 and 106 significant gene-operational taxonomic unit (OTV) pairs 
in the ileum and rectum, respectively, with no overlap between the 
two sites, consistent with the different overall gene expression pat- 
terns that separate them (partial Spearman correlation FDR P < 0.05; 
see Methods, Extended Data Fig. 6b, Supplementary Table 33). The 
genes involved included known IBD-associated host-microbial interac- 
tion factors, including DUOX2 and its maturation factor DUOXA2?}*6, 
both of which were negatively associated with the abundance of 
Ruminococcaceae UCG 005 (OTU 839) in the ileum. The expression 
of several chemokine genes, some of which have reported antimicrobial 
properties*” (CXCL6, CCL20), were negatively correlated with the 
relative abundance of Eubacterium rectale (OTU 120) in the ileum, 
and Streptococcus (OTU 37) and Eikenella (OTU 339) in the rectum, 
suggesting that these species are the most susceptible to the activity of 
these chemokines. Finally, although this cohort was not designed for 
genetic association discovery (Supplementary Discussion, Extended 
Data Fig. 6c, d, Supplementary Table 34), we also provide exome 
sequencing for 92 subjects, which may be integrated with larger 
populations in the future. 
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Boxplots show median and lower/upper quartiles; whiskers show inner 
fences. c, Significant associations among 10 aspects of host-microbiome 
interactions: metagenomic species, species-level transcription ratios, 
functional profiles captured as EC gene families (MGX, MTX and 

MPX), metabolites, host transcription (rectum and ileum), serology, 

and calprotectin (sample counts in Fig. 1b, c). Network shows top 300 
significant correlations (FDR P < 0.05) between each pair of measurement 
types (for serology, FDR P < 0.25). Nodes coloured by disease group in 
which they are ‘high, edges by sign and strength of association. Spearman 
correlations use residuals of a mixed-effects model with subjects as 
random effects (or a simple linear model when only baseline samples were 
used (biopsies)) after covariate adjustment (see Methods). Time points 
approximately matched with maximum separation 4 weeks (see Methods). 
Singletons pruned for visualization (Extended Data Fig. 8). Hubs (nodes 
with at least 20 connections) emphasized. 


Dynamic, multi-omic microbiome interactions 

We next searched for host and microbial molecular interactions that 
might underlie disease activity in IBD by constructing a large-scale 
cross-measurement type association network that incorporated ten 
microbiome measurements: metagenomic species, species-level 
transcription ratios, functional profiles captured as Enzyme 
Commission (EC) gene families (MGX, MTX and MPX), metabo- 
lites, host transcription (rectum and ileum separately), serology, and 
faecal calprotectin. To identify co-variation between components of 
the microbiome above and beyond those linked strictly to inflam- 
mation and disease state, each measurement type was first residu- 
alized using the same mixed-effects model (or linear model when 
appropriate) used to determine differential abundance (‘adjusted’ 
network; see Methods). This residualization uses longitudinal meas- 
urements to minimize any inter-individual variation (including IBD 
status), as well as dysbiotic excursions as drivers of the detected 
associations, and thus highlights within-person associations over 
time. The resulting network contained 53,161 total significant edges 
(FDR P < 0.05) and 2,916 nodes spanning features from all meas- 
urement types (Supplementary Table 35). We constructed a filtered 
subnetwork for visualization from the top 300 edges (by P value) 
per measurement type in which at least one connected node was 
dysbiosis-associated (Fig. 4c). 


Representatives from the five stool-derived measurements occurred 
as hubs (defined as nodes with at least 20 connections) in this network, 
all of which were identified as differentially abundant in dysbiosis. 
Particularly connected taxonomic features (from metagenomes and 
metatranscriptomes) included the abundances of FE prausnitzii and 
unclassified clades related to Subdoligranulum’®, which are closely phy- 
logenetically related, although the only molecular features common 
to both organisms covaried with the abundances of cholesterol and 
inosine (Extended Data Fig. 7a). E prausnitzii accounted for some of 
the strongest associations overall, including the expression of numerous 
ECs that were downregulated in dysbiosis. On the other hand, E. coli 
(and to lesser extent Haemophilus parainfluenzae) accounted for a large 
fraction of upregulated ECs. Members of the Roseburia genus were 
also associated, metatranscriptionally as well as metagenomically, with 
bile acids and a number of acylcarnitines, suggesting that Roseburia 
(together with Subdoligranulum) are involved in the carnitine and bile 
acid dysregulation observed in IBD. 

Acylcarnitines and bile acids as overall chemical classes featured 
prominently in the network, related in part to their changes during 
dysbiosis. Acylcarnitines were associated with numerous dysbiosis- 
associated species including R. hominis (nine acylcarnitines, FDR 
P < 0.05; Supplementary Table 35), Klebsiella pneumoniae (three), and 
H. parainfluenzae (three), as well as expression of C. bolteae (three), 
suggesting that multiple scales of regulation, including long-term 
growth-based and short-term transcriptional, are involved. Particularly 
notable biochemical hubs in the network included C8 carnitine, another 
acylcarnitine that was significantly increased in dysbiotic CD, cholate, 
chenodeoxycholate, and taurochenodeoxycholate, which together 
accounted for 107 edges (6%; Fig. 4c). Other prominent metabolite 
associations included several long-chain lipid hubs and the SCFA pro- 
pionate; antibodies against OmpC were strongly associated with these, 
as well as with the metagenomic abundances of the numerous ECs 
involved in the system's biosynthesis or as interactors. Calprotectin, as 
the sole feature in its own measurement type, was weakly associated with 
a number of metabolites that were not differentially abundant in dysbi- 
osis, as well as with the metagenomic abundance of several dysbiosis- 
associated ECs. Three host genes appeared in this high-significance 
subnetwork: ileal expression of GIP, NXPE4, and ANXA10. Expression 
of RNA polymerase was also a prominent node in the network, though 
not a hub, that was upregulated in dysbiosis (Extended Data Fig. 8). The 
regulation of this essential enzyme class is growth-rate-dependent”’, 
suggesting that microbial communities as a whole are more often in 
higher growth conditions in dysbiotic IBD. 

Finally, we also identified associations among features in the micro- 
biome that took dysbiosis into account, resulting in a second network 
using the same methodology but without adjusting for dysbiosis 
(‘unadjusted’; Supplementary Discussion, Extended Data Figs. 7b, 9, 
Supplementary Table 36). Together, these networks contextualize the 
multiple types of microbiome disruption that are observed in IBD, with 
associations among many molecular feature types that represent poten- 
tial targets for follow-up studies on the mechanisms that underlie IBD 
and gastrointestinal inflammation. 


Conclusions 

As part of the HMP2, we have developed the IBDMDB, one of the first 
integrated studies of multiple molecular features of the gut microbiome 
that have been implicated in IBD dynamics. While overall population 
structure was comparable among measurements of the microbiome— 
metagenomic, metatranscriptomic, metabolomic, and others—each 
measurement identified complementary molecular components 
of longitudinal dysbioses in CD and UC. Some, such as taxonomic 
shifts in favour of aerotolerant, pro-inflammatory clades, have been 
captured by previous studies; others, such as greater gene expression 
by clostridia during disease, were discovered by the use of new meas- 
urements (metatranscriptomes). The temporal stability of multiple 
microbiome measurements likewise differed across IBD phenotypes 
and disease activity, with distinct effects on molecular components of 
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the microbiome (including unexpected stability of the relative abun- 
dance of P. copri in individuals with IBD). Our data provide a catalogue 
of new relationships between multi-omic features identified as poten- 
tially central during IBD, in addition to data, protocols, and relevant 
bioinformatic approaches to enable future research. 

By leveraging a multi-omic view on the microbiome, our results 
single out a number of host and microbial features for follow-up 
characterization. An unclassified Subdoligranulum species, recently 
shown to forma complex of new species-level clades*®, was both markedly 
reduced in IBD and central to the functional network, associating with 
a wide range of IBD-linked metabolites both identifiable (for example, 
bile acids and polyunsaturated fatty acids) and unidentifiable. The clade 
is likely to contain at least seven species that are closely related to the 
Subdoligranulum, Gemmiger, and Faecalibacterium genera, typically 
butyrate producers that are considered to be beneficial, particularly 
in IBD*°. Therefore, the isolation and characterization of additional 
species—especially in tandem with these associated metabolites—is 
likely to reveal these clades’ physiological and immunological inter- 
actions and the consequences of their depletion in IBD. More gen- 
erally, strain-level profiling of implicated microorganisms remains to 
be carried out, particularly in direct association with host epithelium 
and corresponding molecular changes. This profiling is feasible with 
existing data from this study, and will serve to pinpoint the specific 
organisms responsible for IBD-associated accumulation of primary 
unconjugated bile acids and depletion of secondary bile acids*’. Only 
very few, low-abundance species are currently known to be capable of 
secondary bile acid metabolism’, and expanding the range of strains 
known to carry appropriate metabolic cassettes will indicate poten- 
tial new targets for therapeutic restoration. Beyond short-chain fatty 
acids and bile acids, the large-scale acylcarnitine dysbiosis observed 
here may also provide a promising new target for IBD, particularly 
after determining whether this shift in metabolite pools is host- or 
microbiome-driven. 

We stress that it has not yet been determined whether these multi- 
omic features of the microbiome can predict disease events before 
their occurrence and that the disease-relevant time scales of distinct 
molecular events have not been identified (for example, static host 
genetics, relatively slow epigenetics or microbial growth, rapid host 
and microbial transcriptional changes). It may also be fruitful to seek 
out the earliest departures from a subject-specific baseline state that, 
while themselves still ‘eubiotic, may predict the subsequent onset of 
dysbiosis or disease symptoms. Some such characterization may be 
possible in data from this study, although other causal analysis may 
be better carried out at finer-grained time scales or using interven- 
tional study designs. It will be most important to take these molecular 
results back to the clinic, in the form of better predictive biomarkers 
of IBD progression and outcome, and as a set of new host-microbe 
interaction targets for which treatments to ameliorate the disease may 
be developed. 


Online content 

Any methods, additional references, Nature Research reporting summaries, source 
data, statements of data availability and associated accession codes are available at 
https://doi.org/10. 1038/s41586-019-1237-9. 
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METHODS 

Recruitment and specimen collection. Recruitment. Five medical centres partici- 
pated in the IBDMDB: Cincinnati Children’s Hospital, Emory University Hospital, 
Massachusetts General Hospital, Massachusetts General Hospital for Children, and 
Cedars-Sinai Medical Center. Patients were approached for potential recruitment 
upon presentation for routine age-related colorectal cancer screening, work up 
of other gastrointestinal (GI) symptoms, or suspected IBD, either with positive 
imaging (for example, colonic wall thickening or ileal inflammation) or symp- 
toms of chronic diarrhoea or rectal bleeding. Participants could not have had a 
prior screening or diagnostic colonoscopy. Potential participants were excluded 
if they were unable to or did not consent to provide tissue, blood, or stool, were 
pregnant, had a known bleeding disorder or an acute gastrointestinal infection, 
were actively being treated for a malignancy with chemotherapy, were diagnosed 
with indeterminate colitis, or had undergone a prior, major gastrointestinal surgery 
such as an ileal/colonic diversion or j-pouch. Upon enrolment, an initial colonos- 
copy was performed to determine study strata. Subjects not diagnosed with IBD 
based on endoscopic and histopathologic findings were classified as ‘non-IBD’ 
controls, including the aforementioned healthy individuals presenting for routine 
screening, and those with more benign or non-specific symptoms. This creates a 
control group that, while not completely ‘healthy’ differs from the IBD cohorts 
specifically by clinical IBD status. Differences observed between these groups are 
therefore more likely to constitute differences specific to IBD, and not differences 
attributable to general GI distress. In total, 132 subjects took part in the study 
(Extended Data Table 1). 

Regulatory compliance. The study was reviewed by the Institutional Review Boards 
at each sampling site: overall Partners Data Coordination (IRB #2013P002215); 
MGH Adult cohort (IRB #2004P001067); MGH Paediatrics (IRB #2014P001115); 
Emory (IRB #IRB00071468); Cincinnati Children’s Hospital Medical Center (2013- 
7586); and Cedars-Sinai Medical Center (3358/CR00011696). All study partici- 
pants gave written informed consent before providing samples. Each IRB has a 
federal wide assurance and follows the regulations established at 45 CFR Part 46. 
The study was conducted in accordance with the ethical principles expressed in 
the Declaration of Helsinki and the requirements of applicable federal regulations. 
Specimen collection and storage. Specimens for research (biopsies, blood draws, 
and stool samples) were collected during the screening colonoscopy, at up to five 
quarterly follow-up visits at the clinic (termed ‘baseline’ visit 2, and so on, occur- 
ring at months 0, 3, 6, 9, and 12), and every two weeks by mail. 

Biopsies. Biopsies were primarily gathered during the initial screening colonoscopy, 
where approximately four to fourteen biopsies were collected for each subject. For 
each location sampled (at least ileum and 10 cm from rectum, plus discretionary 
sites of inflammation), one biopsy was collected for standard histopathology at 
the sampling institution, two biopsies were collected and stored in RNAlater for 
molecular data generation (host and microbial, stored at -20°C), and one biopsy 
was collected and placed in a sterile tube with 5% glycerol (stored at -80°C). If 
possible, additional biopsies from inflamed tissue and nearby non-inflamed tissue 
were taken from participants with CD or UC. For adults, a second set of biopsies 
was also collected from each location (rectum and ileum) for epithelial cell cul- 
ture (for detailed protocols see http://ibdmdb.org/protocols). All biopsies were 
stored for up to two months at the collection site, and shipped overnight on dry 
ice to Washington University for epithelial cell culture or to the Broad Institute 
for molecular profiling. 

Blood samples. Blood samples (whole blood and serum) were taken at the quarterly 
clinical visits. For whole blood, 1 ml of blood was collected and stored at -80°C. 
For serum, blood was drawn into a 5-ml SST tube, and left at room temperature 
for 40 min. This was centrifuged for 15 min at 3,000 r.p.m. and 0.5-ml portions 
were immediately aliquoted into 2-m]l microtubes. Tubes were stored at -80°C. 
Stool samples. Stool specimens were collected both at the clinical visits and every 
two weeks by mail using a home collection kit developed for the project (http:// 
ibdmdb.org/protocols) and previously validated**. Participants first deposited 
stool into a collection bowl suspended over a commode. They then collected two 
aliquots using a scoop to transfer stool into two Sarstedt 80.623 tubes: one with 
approximately 5 ml molecular biology grade 100% ethanol, and one with no pre- 
servative. Stool samples were then sent from each participant by FedEx to the 
Broad Institute where they were processed immediately before storage at -80°C. 
The ethanol tube was centrifuged to pellet stool, which was subaliquotted, and the 
supernatant was transferred to a new tube for metabolomic analysis. Stool from 
ethanol was aliquoted into 2-ml cryovials in ~100-200-mg aliquots, prioritizing 
specimens for metaomic sequencing, metabolomics, and viromics in that order. 
Any remaining stool was stored in additional aliquot tubes. One hundred mil- 
ligrams of the non-ethanol stool was stored for assaying faecal calprotectin and 
the remainder was saved in a second tube. All samples were stored at -80°C after 
receipt before processing. This home-collection method was shown previously to 
produce reproducible results compared to flash-frozen samples“, consistent with 
previous observations across data types*”-*”. Note that an accurate estimate of the 
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stool water content could not be obtained, as samples were collected by subjects 
and preserved in ethanol at room temperature until aliquots were generated for 
the different data generation platforms. 

Participant and sample metadata. Descriptions of each participant and specimen 
were captured at baseline and accompanying each specimen collection, respec- 
tively. At baseline (that is, during or before the screening colonoscopy), subjects 
completed a Reported Symptoms Questionnaire, the Short Inflammatory Bowel 
Disease Questionnaire”, a Food Frequency Questionnaire, and an Environmental 
Questionnaire, and the Simple Endoscopic Score*! for CD subjects or Baron's 
Score™ for UC subjects was assessed. 

During both follow-up visits and paired with mailed stool samples, subjects 

completed an Activity Index and Dietary Recall Questionnaire to assess their 
disease activity index (HBI for CD or SCCAI for UC) and provide a retrospec- 
tive recall of their recent diet. All questionnaires, as well as detailed protocols 
(including product numbers), can be found on the IBDMDB data portal at http:// 
ibdmdb.org/protocols. Responses and metadata are available at http://ibdmdb. 
org/results, and summaries of phenotypes for samples and subjects are provided 
(Supplementary Fig. 3) along with summaries of the final time series for each 
subject (Supplementary Fig. 2). 
Stool specimen processing. Sample selection. Sample selection proceeded in two 
phases, with an initial round of data generation producing a pilot metagenomics 
and metatranscriptomics data set, which was analysed separately**. This pilot sam- 
ple selection included at least one sample per participant that was enrolled in the 
study at that time, two long time courses per disease group (CD, UC, non-IBD), 
and multiple shorter time courses, resulting in 300 samples. For a subset of 78 
samples, metatranscriptomic data were generated. Samples were chosen on the 
basis of sample mass, preferentially selecting samples that could be re-sequenced 
if needed during the later data generation. 

For the second, larger phase of data generation, stool samples were selected for 
different assays with the goal of generating data covering as many aspects of the 
cohort as possible, including per-subject time courses, cross-subject global time 
points, and samples from all patients, phenotypes, age ranges, clinical centres, and 
so forth (Fig. 1b). The subset of measurements performed for each sample was 
determined in large part by aliquot requirements (in particular, mass requirements 
for the assay relative to how much the patient provided) and cost. 

For proteomics and metabolomics, six global time points were equally distrib- 
uted over the year-long time series for as many subjects as possible. Restrictions 
such as available sample mass and missing samples were incorporated by select- 
ing the nearest suitable sample in time, resulting in slight irregularities in the 
sampling pattern. In total, 546 metabolite profiles and 450 proteomics profiles 
were generated. From among these samples, 768 were selected for metagenom- 
ics, metatranscriptomics, and viromics, corresponding to 8 plates of 96 samples 
each. Samples already selected for proteomics or metabolomics were prioritised 
to facilitate integrated data analysis (316 samples had sufficient mass), resulting in 
six global time points for all subjects. In cases where the respective sample was not 
available for a subject, the nearest suitable sample in time was selected. Subjects 
with greater fluctuations in their HBI or SCCAI scores were then prioritized for 
denser sampling, resulting in 12 long time courses for 5 participants with CD, 4 
with UC, and 3 without IBD. The selection also included 23 technical replicates 
for metagenomics, metatranscriptomics and viromics. 

Finally, 576 additional samples were selected specifically for metagenomic 
sequencing (6 plates) resulting in a total of 1,344 metagenomic samples. Samples 
at previously selected global time points and long time courses that had been 
restricted by available mass for other measurement types were prioritized. An 
additional four global time points were added by this process, as well as 15 long 
time courses (representing 10 participants with CD, 10 with UC, and 7 without 
IBD), and 22 samples that had been previously sequenced for the pilot data and 
represented additional technical replicates. Lastly, 522 samples were selected for 
faecal calprotectin measurements, prioritizing samples that were selected for any 
other multi-omics data generation and representing a broad overview of the cohort. 
Ofa total of 2,653 collected stool samples, 1,785 generated at least one measure- 
ment type (Fig. 1b). 

Sample selection for RNA-seq and 16S sequencing from biopsies, and host gen- 
otyping from blood draws, aimed to cover the 95 subjects who contributed at least 
14 stool samples, as permitted by the availability of biopsies and blood draws for 
each assay. Sample selection from biopsies additionally aimed to cover biopsies 
from inflamed and non-inflamed sites. In total, 254 biopsies were selected for 
RNA-seq, covering 43 participants with CD, 25 with UC, and 22 without IBD, and 
distributed across biopsy sites and inflammation statuses (Extended Data Fig. 6A); 
and 161 biopsies were selected for 16S sequencing, covering 36 participants with 
CD, 21 with UC, and 22 without IBD. Exome sequencing was performed for 46 
participants with CD, 24 with UC, and 22 without IBD. 

Sample selection for remaining sample types (RRBS, blood serology) included 
all samples with a suitable sample available. 
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Sequencing assays. DNA and RNA isolation for metagenomics and metatranscrip- 
tomics. Total nucleic acid was extracted from one aliquot of each assayed stool 
sample via the Chemagic MSM I with the Chemagic DNA Blood Kit-96 from 
Perkin Elmer. This kit combines chemical and mechanical lysis with magnetic 
bead-based purification. Prior to extraction on the MSM-I, TE buffer, lysozyme, 
proteinase K, and RLT buffer with beta-mercaptoethanol were added to each stool 
sample. The stool lysate solution was vortexed to mix. 

Samples were then placed on the MSM I unit to automate the following steps: 
M-PVA magnetic beads were added to the stool lysate solution and vortexed to mix. 
The bead-bound total nucleic acid was then removed from solution using a 96-rod 
magnetic head and washed in three ethanol-based wash buffers. The beads were 
then washed in a final water wash buffer. Finally, the beads were dipped in elution 
buffer to resuspend the DNA sample in solution. The beads were then removed 
from solution, leaving purified total nucleic acid eluate. The eluate was then split 
into two equal volumes: one for DNA and the other for RNA. SUPERase-IN 
solution was added to the DNA samples, and the reaction was cleaned up using 
AMPure XP SPRI beads. DNase was added to the RNA samples, and the reaction 
was cleaned up using AMPure XP SPRI beads. 

DNA samples were quantified using a fluorescence-based PicoGreen assay. RNA 

samples were quantified using a fluorescence-based RiboGreen assay. RNA quality 
was assessed via smear analysis on the Caliper LabChip GX. 
Metagenome sequencing. Metagenomes were generated from the resulting DNA for 
1,638 stool samples, selected to obtain both a broad overview of targeted, aligned 
time points for all subjects (Fig. 1b), complemented by a dense sampling of sub- 
jects which tended to have greater disease activity, as determined by their HBI or 
SCCAI scores. 

Whole-genome fragment libraries were prepared as follows. Metagenomic DNA 
samples were quantified by Quant-iT PicoGreen dsDNA Assay (Life Technologies) 
and normalized to a concentration of 50 pg/ul. Illumina sequencing libraries were 
prepared from 100-250 pg DNA using the Nextera XT DNA Library Preparation 
kit (Illumina) according to the manufacturer’s recommended protocol, with reac- 
tion volumes scaled accordingly. Prior to sequencing, libraries were pooled by 
collecting equal volumes (200 nl) of each library from batches of 96 samples. Insert 
sizes and concentrations for each pooled library were determined using an Agilent 
Bioanalyzer DNA 1000 kit (Agilent Technologies). Libraries were sequenced on 
HiSeq2000 or 2500 2x101 to yield ~10 million paired end reads. Post-sequencing 
de-multiplexing and generation of BAM and FASTQ files were generated using 
the Picard suite (https://broadinstitute.github.io/picard). 

Metatranscriptome sequencing. Metatranscriptomes were generated for 855 stool 
samples, subsampled from metagenomic selections as above. Illumina cDNA 
libraries were generated using a modified version of the RNAtag-seq protocol”. 
In brief, 500 ng-1 j1g of total RNA was fragmented, depleted of genomic DNA, 
dephosphorylated, and ligated to DNA adapters carrying 5’-AN8-3’ barcodes of 
known sequence with a 5’ phosphate and a 3’ blocking group. Barcoded RNAs were 
pooled and depleted of rRNA using the RiboZero rRNA depletion kit (Epicentre). 
Pools of barcoded RNAs were converted to Illumina cDNA libraries in two main 
steps: (i) reverse transcription of the RNA using a primer designed to the constant 
region of the barcoded adaptor with addition of an adaptor to the 3’ end of the 
cDNA by template switching using SMARTScribe (Clontech) as described™; 
(ii) PCR amplification using primers whose 5’ ends target the constant regions 
of the 3’ or 5’ adaptors and whose 3’ ends contain the full Illumina P5 or P7 
sequences. cDNA libraries were sequenced on the Illumina HiSeq2500 platform 
to generate ~13 million paired end reads. 

Viromics. We selected 703 stool samples for viral profiling, following the sample 
selection used for metatranscriptomics and adjusted slightly only when aliquots 
were unavailable (Fig. 1c). Viral nucleic acids were extracted using the MagMax 
Viral RNA Isolation Kit (AM1939, Thermo Fisher Scientific). Viral RNA was 
reverse transcribed using SuperScript II RT (18064014, Thermo Fisher) and 
random hexamers. After short molecule and random hexamer removal using 
ChargeSwitch (CS12000, Thermo Fisher), molecules were amplified and tagged 
with a BC12-V8A2 construct® using AccuPrimeTM Taq polymerase and cleaned 
with ChargeSwitch kit. 

The resulting viral amplicons were normalized, pooled, and made into an 
Illumina library without shearing. The library (150-600 bp) was loaded into an 
Illumina HiSeq 2000 (Illumina, Carlsbad, CA) and sequenced using the 2 x 100 bp 
chemistry. Reads were demultiplexed into a sample bin using the barcode pre- 
fixing read-1 and read-2, allowing zero mismatches. Demultiplexed reads were 
further processed by trimming off barcodes, semi-random primer sequences, and 
Illumina adapters. This process used a custom demultiplexer and the BBDuk algo- 
rithm included in BBMap (http://sourceforge.net/projects/bbmap). The resulting 
trimmed data set was analysed using a pipeline created at the Alkek Center for 
Metagenomics and Microbiome Research at Baylor College of Medicine’”. In brief, 
the viral analysis pipeline uses a clustering algorithm creates putative viral genomes 
using a mapping assembly strategy that leverages nucleotide and translated 


nucleotide alignment information. Viral taxonomies were assigned using a scoring 
system that incorporates nucleotide and translated nucleotide alignment results 
in a per-base fashion and optimizes for the highest resolution taxonomic rank. 
Metabolomics. Sample selection, receipt, and storage. Sample selection for metab- 
olomics aimed to obtain only a broad sampling of many subjects. In total, 546 
stool samples were selected for profiling (Fig. 1b). A portion of each selected stool 
sample (40-100 mg) and the entire volume of originating ethanol preservative 
were stored in 15-ml centrifuge tubes at -80°C until all samples were collected. 
Sample processing. Samples were thawed on ice and then centrifuged (4°C, 5,000g) 
for 5 min. Ethanol was evaporated using a gentle stream of nitrogen gas using a 
nitrogen evaporator (Turbo Vap LV; Biotage) and stored at -80°C until all samples 
in the study had been dried. Aqueous homogenates were generated by sonicat- 
ing each sample in 900 jl H2O using an ultrasonic probe homogenizer (Branson 
Sonifier 250) set to a duty cycle of 25% and output control of 2 for 3 min. Samples 
were kept on ice during the homogenization process. The homogenate for each 
sample was aliquoted into two 10-1 and two 30-11 samples in 1.5-ml centrifuge 
tubes for LC-MS sample preparation and 30 \1l of homogenate from each sample 
was transferred into a 50-ml conical tube on ice to create a pooled reference sample. 
The pooled reference mixture was mixed by vortexing and then aliquoted (100 j1l 
per aliquot) into 1.5-ml centrifuge tubes. Aliquots and reference sample aliquots 
were stored at -80°C until LC-MS analyses were conducted. 

LC-MS analyses. A combination of four LC-MS methods were used to profile 
metabolites in the faecal homogenates, as previously published**; two methods that 
measure polar metabolites, a method that measures metabolites of intermediate 
polarity (for example, fatty acids and bile acids), and a lipid profiling method. For 
the analysis queue in each method, subjects were randomized and longitudinal 
samples from each subject were randomized and analysed as a group. Additionally, 
pairs of pooled reference samples were inserted into the queue at intervals of 
approximately 20 samples for quality control and data standardization. Samples 
were prepared for each method using extraction procedures that are matched for 
use with the chromatography conditions. Data were acquired using LC-MS sys- 
tems comprised of Nexera X2 U-HPLC systems (Shimadzu Scientific Instruments) 
coupled to Q Exactive/Exactive Plus orbitrap mass spectrometers (Thermo Fisher 
Scientific). The method details are summarized below. 

LC-MS Method 1: HILIC-pos (positive ion mode MS analyses of polar metab- 
olites). LC-MS samples were prepared from stool homogenates (10 1l) by protein 
precipitation with the addition of nine volumes of 74.9:24.9:0.2 v/v/v acetoni- 
trile/methanol/formic acid containing stable isotope-labelled internal standards 
(valine-d8, Isotec; and phenylalanine-d8, Cambridge Isotope Laboratories). The 
samples were centrifuged (10 min, 9,000g, 4°C), and the supernatants injected 
directly onto a 150 x 2-mm Atlantis HILIC column (Waters). The column was 
eluted isocratically at a flow rate of 250 j1l/min with 5% mobile phase A (10 mM 
ammonium formate and 0.1% formic acid in water) for 1 min followed by a linear 
gradient to 40% mobile phase B (acetonitrile with 0.1% formic acid) over 10 min. 
MS analyses were carried out using electrospray ionization in the positive ion 
mode using full scan analysis over m/z 70-800 at 70,000 resolution and 3-Hz data 
acquisition rate. Additional MS settings are: ion spray voltage, 3.5 kV; capillary 
temperature, 350°C; probe heater temperature, 300°C; sheath gas, 40; auxiliary 
gas, 15; and S-lens RF level 40. 

LC-MS Method 2: HILIC-neg (negative ion mode MS analysis of polar metab- 
olites). LC-MS samples were prepared from stool homogenates (30 11) by pro- 
tein precipitation with the addition of four volumes of 80% methanol containing 
inosine-15N4, thymine-d4 and glycocholate-d4 internal standards (Cambridge 
Isotope Laboratories). The samples were centrifuged (10 min, 9,000g, 4°C) and 
the supernatants were injected directly onto a 150 x 2.0-mm Luna NH2 column 
(Phenomenex). The column was eluted at a flow rate of 400 l/min with initial 
conditions of 10% mobile phase A (20 mM ammonium acetate and 20 mM ammo- 
nium hydroxide in water) and 90% mobile phase B (10 mM ammonium hydroxide 
in 75:25 v/v acetonitrile/methanol) followed by a 10-min linear gradient to 100% 
mobile phase A. MS analyses were carried out using electrospray ionization in the 
negative ion mode using full scan analysis over m/z 60-750 at 70,000 resolution 
and 3 Hz data acquisition rate. Additional MS settings are: ion spray voltage, 
-3.0 kV; capillary temperature, 350°C; probe heater temperature, 325°C; sheath 
gas, 55; auxiliary gas, 10; and S-lens RF level 40. 

LC-MS Method 3: C18-neg (negative ion mode analysis of metabolites of inter- 
mediate polarity; for example, bile acids and free fatty acids). Stool homogenates 
(30 pl) were extracted using 90 j1l methanol containing PGE2-d4 as an internal 
standard (Cayman Chemical Co.) and centrifuged (10 min, 9,000g, 4°C). The 
supernatants (10 jl) were injected onto a 150 x 2.1-mm ACQUITY BEH C18 col- 
umn (Waters). The column was eluted isocratically at a flow rate of 450 j1l/min with 
20% mobile phase A (0.01% formic acid in water) for 3 min followed by a linear 
gradient to 100% mobile phase B (0.01% acetic acid in acetonitrile) over 12 min. 
MS analyses were carried out using electrospray ionization in the negative ion 
mode using full scan analysis over m/z 70-850 at 70,000 resolution and 3 Hz data 


acquisition rate. Additional MS settings are: ion spray voltage, -3.5 kV; capillary 
temperature, 320°C; probe heater temperature, 300°C; sheath gas, 45; auxiliary 
gas, 10; and S-lens RF level 60. 

LC-MS Method 4: C8-pos. Lipids (polar and nonpolar) were extracted from 
stool homogenates (10 11) using 190 1l isopropanol containing 1-dodecanoyl-2- 
tridecanoyl-sn-glycero-3-phosphocholine as an internal standard (Avanti Polar 
Lipids; Alabaster, AL). After centrifugation (10 min, 9,000g, ambient temperature), 
supernatants (10 jl) were injected directly onto a 100 x 2.1-mm ACQUITY BEH 
C8 column (1.7 jum; Waters). The column was eluted at a flow rate of 450 l/min 
isocratically for 1 min at 80% mobile phase A (95:5:0.1 v/v/vl 10 mM ammonium 
acetate/methanol/acetic acid), followed by a linear gradient to 80% mobile phase 
B (99.9:0.1 v/v methanol/acetic acid) over 2 min, a linear gradient to 100% mobile 
phase B over 7 min, and then 3 min at 100% mobile phase B. MS analyses were 
carried out using electrospray ionization in the positive ion mode using full scan 
analysis over m/z 200-1,100 at 70,000 resolution and 3 Hz data acquisition rate. 
Additional MS settings are: ion spray voltage, 3.0 kV; capillary temperature, 300°C; 
probe heater temperature, 300°C; sheath gas, 50; auxiliary gas, 15; and S-lens RF 
level 60. 

Metabolomics data processing. Raw LC-MS data were acquired to the data acquisi- 
tion computer interfaced to each LC-MS system and then stored on a robust and 
redundant file storage system (Isilon Systems) accessed via the internal network at 
the Broad Institute. Nontargeted data were processed using Progenesis QlIsoftware 
(v 2.0, Nonlinear Dynamics) to detect and de-isotope peaks, perform chromato- 
graphic retention time alignment, and integrate peak areas. Peaks of unknown ID 
were tracked by method, m/z and retention time. Identification of nontargeted 
metabolite LC-MS peaks was conducted by: i) matching measured retention 
times and masses to mixtures of reference metabolites analysed in each batch; and 
ii) matching an internal database of >600 compounds that have been characterized 
using the Broad Institute methods. Temporal drift was monitored and normalized 
with the intensities of features measured in the pooled reference samples. 
Proteomics. Sample selection and LC-MS/MS. Sample selection for proteomics 
largely followed sample selection for metabolomics (Fig. 1b, c), with slight adjust- 
ments when aliquots were unavailable. In total, 447 stool samples were targeted for 
profiling. From the selected samples, proteins were proteolytically digested using 
trypsin, and each digest was subjected to automated offline high-pH reversed- 
phase fractionation with fraction concatenation. LC-MS/MS analysis for each 
fraction was performed using a Thermo Scientific Q-Exactive Orbitrap mass 
spectrometer at UCLA, outfitted with a custom-built nano-ESI interface. Samples 
were loaded onto an in-house packed capillary LC column (70 cm x 75 pm, 3-1m 
particle size), and data were acquired for 120 min. Precursor MS spectra were 
collected over 400-2,000 m/z, followed by data-dependent MS/MS spectra of the 
twelve most abundant ions, using a collision energy of 30%. A dynamic exclusion 
time of 30 s was used to discriminate against previously analysed ions. 

Peptide identification and protein data roll-up. Mass spectra from the resulting 
analyses were evaluated using the MSGF* software” v10072 using the HMP 1 gut 
reference genomes (HMP_Refgenome-gut_2015-06-18). In brief, after conversion 
of the metagenomic assemblies into predicted open reading frames (for example, 
predicted proteins), libraries were created using the forward and reverse direction 
to allow determination of FDR. The reverse decoy database allows measurement 
of the rate of detection of false hits, which in turn allows calculation of FDR and 
appropriate filtering of the data to maximize real peptide identifications while 
minimizing spurious ones. MSGF* was then used to search the experimental mass 
spectra data against both the forward and reverse decoy databases. Cut-offs for data 
included: MSGF* spectra probability (>1 x 10!°, equivalent to a BLAST e value), 
mass accuracy (+ 20 p.p.m.), protein level FDR of 1% and one unique peptide per 
protein identification. 

Faecal calprotectin. Faecal calprotectin was quantified for 652 stool samples, 
which were stored at -80°C without preservative before processing. Sample selec- 
tion focused on obtaining a broad survey of all subjects rather than detailed time 
series (Fig. 1b). Calprotectin was quantified using QUANTA Lite Calprotectin 
ELISA (Inova Diagnostics 704770) following the manufacturer's protocol. Between 
80 and 120 mg of stool was used for input. Incubation time before stopping the 
reaction was adjusted to obtain OD4o5 values in the suggested range for assay. 
Biopsy specimen processing. Co-isolation of DNA and RNA from frozen tissue. 
DNA and RNA were extracted from RNA-later-preserved biopsies using the 
AllPrep DNA/RNA Universal Kit from Qiagen. Biological samples were cut into 
20-25-mg pieces on a dry ice batch, then placed in tubes with a steel bead for 
mechanical homogenization and a highly denaturing guanidine isothiocyanate- 
containing buffer, which immediately inactivates DNases and RNases to ensure 
isolation of intact DNA and RNA. After homogenization, the lysate was passed 
through an AllPrep DNA Mini spin column. This column, in combination with 
the high-salt buffer, allows selective and efficient binding of genomic DNA. 
On-column proteinase K digestion in optimized buffer conditions allows puri- 
fication of high DNA yields from all sample types. The column was then washed 
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and DNA was eluted in TE buffer. Flow-through from the AllPrep DNA Mini spin 
column was digested by proteinase K in the presence of ethanol. This optimized 
digestion, together with the subsequent addition of further ethanol, allowed for 
appropriate binding of total RNA, including miRNA, to the RNeasy Mini spin col- 
umn. Samples were then digested with DNase I to ensure high-yields of DNA-free 
RNA. Contaminants were efficiently washed away and RNA was eluted in water. 
16S rRNA gene profiling. We selected 178 biopsies for 16S amplicon-based taxo- 
nomic profiling. The 16S rRNA gene-sequencing protocol was adapted from the 
Earth Microbiome Project® and the Human Microbiome Project®!®. In brief, bac- 
terial genomic DNA was extracted from the total mass of the biopsied specimens 
using the MoBIO PowerLyzer Tissue and Cells DNA isolation kit and sterile spat- 
ulas for tissue transfer. The 16S rDNA V4 region was amplified from the extracted 
DNA by PCR and sequenced in the MiSeq platform (Illumina) using the 2 x 250 bp 
paired-end protocol, yielding pair-end reads that overlapped almost completely. 
The primers used contained adapters for MiSeq sequencing and single-index bar- 
codes such that PCR products may be pooled and sequenced directly*!, targeting 
at least 10,000 reads per sample. 

Read pairs were demultiplexed and merged using USEARCH v7.0.1090%. 
Sequences were clustered into OTUs at a similarity threshold of 97% using the 
UPARSE algorithm®. OTUs were subsequently mapped to a subset of the SILVA 
database® containing only sequences from the V4 region of the 16S rRNA gene to 
determine taxonomies. Abundances were then recovered by mapping the demul- 
tiplexed reads to the UPARSE OTUs, producing the final taxonomic profiles. The 
150 samples with >1,000 mapped reads were used in downstream analyses. 

Host RNA-seq. cDNA library construction. In total, 252 biopsies were selected for 
transcriptional profiling. Total RNA was quantified using the Quant-iT RiboGreen 
RNA Assay Kit and normalized to 5 ng/il. Following plating, 2 jl of ERCC controls 
(using a 1:1,000 dilution) were spiked into each sample. An aliquot of 200 ng for 
each sample was transferred into library preparation, which was an automated var- 
iant of the Illumina TruSeq Stranded mRNA Sample Preparation Kit. This method 
preserves strand orientation of the RNA transcript. It uses oligo dT beads to select 
mRNA from the total RNA sample. It is followed by heat fragmentation and cDNA 
synthesis from the RNA template. The resultant 500-bp cDNA then goes through 
library preparation (end repair, base ‘A addition, adaptor ligation, and enrichment) 
using Broad Institute designed indexed adapters substituted in for multiplexing. 
After enrichment the libraries were quantified using Quant-iT PicoGreen (1:200 
dilution). After normalizing samples to 5 ng/t1l, the set was pooled and quantified 
using the KAPA Library Quantification Kit for Illumina Sequencing Platforms. 
The entire process is in 96-well format and all pipetting is done by either Agilent 
Bravo or Hamilton Starlet. 

Illumina sequencing. Pooled libraries were normalized to 2 nM and denatured using 
0.1 N NaOH before sequencing. Flowcell cluster amplification and sequencing 
were performed according to the manufacturer's protocols using either the HiSeq 
2000 or HiSeq 2500. Each run was a 101-bp paired-end with an eight-base index 
barcode read. Data were organized using the Broad Institute Picard Pipeline which 
includes de-multiplexing and lane aggregation. 

Blood specimen processing. Serological analysis. We analysed 210 serum samples 
for expression of ANCA, ASCA, anti-OmpC, and anti-CBirl by ELISA as previ- 
ously described®”**. Antibody levels were determined and the results expressed 
as ELISA units (EU/ml), which are relative to laboratory standards consisting of 
pooled, antigen-reactive sera from of patients with well-characterized disease. 
DNA isolation from whole blood. DNA was extracted using Chemagic MSM I with 
the Chemagic DNA Blood Kit-96 from Perkin Elmer. The kit combines chemical 
and mechanical lysis with magnetic bead-based purification. Whole blood samples 
were incubated at 37 °C for 5-10 min to thaw. The blood was then transferred to a 
deep well plate with protease and placed on the Chemagic MSM I. The following 
steps were automated on the MSM I. 

M-PVA magnetic beads were added to the blood and protease solution. Lysis 
buffer was added to the solution and vortexed to mix. The bead-bound DNA was 
then removed from solution using a 96-rod magnetic head and washed in three 
ethanol-based wash buffers to eliminate cell debris and protein residue. The beads 
were then washed in a final water wash buffer. Finally, the beads were dipped in 
elution buffer to resuspend the DNA. The beads were then removed from solution, 
leaving purified DNA eluate. The resulting DNA samples were quantified using a 
fluorescence-based PicoGreen assay. 

Host exome sequencing. Ninety-two host exomes were sequenced from DNA 
extracts using previously published methods”. Whole-exome libraries were con- 
structed and sequenced on an Illumina HiSeq 4000 sequencer with 151-bp paired- 
end reads. Output from Illumina software was processed by the Picard pipeline to 
yield BAM files containing calibrated, aligned reads. 

Library construction. Library construction was performed as described® with 
some slight modifications. Initial genomic DNA input into shearing was reduced 
from 3 1g to 50 ng in 10 1l solution and enzymatically sheared. In addition, for 
adaptor ligation, dual-indexed Illumina paired end adapters were replaced with 
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palindromic forked adapters with unique 8-base index sequences embedded 
within the adaptor and added to each end. 

In-solution hybrid selection for exome enrichment. In-solution hybrid selection was 
performed using the Illumina Rapid Capture Exome enrichment kit with 38 Mb 
target territory (29 Mb baited). The targeted region includes 98.3% of the intervals 
in the Refseq exome database. Dual-indexed libraries were pooled into groups of 
up to 96 samples before hybridization. The liquid handling was automated on a 
Hamilton Starlet. The enriched library pools were quantified using PicoGreen after 
elution from streptavadin beads and then normalized to a range compatible with 
sequencing template denature protocols. 

Preparation of libraries for cluster amplification and sequencing. Following sample 
preparation, the libraries prepared using forked, indexed adapters were quanti- 
fied using quantitative PCR (purchased from KAPA biosystems), normalized to 
2 nM using the Hamilton Starlet Liquid Handling system, and pooled by equal 
volume using the Hamilton Starlet Liquid Handling system. Pools were then dena- 
tured using 0.1 N NaOH. Denatured samples were diluted into strip tubes using 
the Hamilton Starlet Liquid Handling system. 

Cluster amplification and sequencing. Cluster amplification of the templates was 
performed according to the manufacturer’s protocol (Illumina) using the Iumina 
cBot. Flow cells were sequenced on HiSeq 4000 Sequencing-by-Synthesis Kits, then 
analysed using RTA2.7.3 

Host genetic data processing. Host genetic exome sequence data were processed 
using the Broad Institute sequencing pipeline by the Data Sciences Platform 
(Broad Institute). This was done in three steps: pre-processing (including reads 
mapping, alignment to a reference genome and data cleanup), variant discovery 
(including per-sample variant calling and joint genotyping), and variant filtering 
to produce callset ready for downstream genetic analysis, using Genome Analysis 
Toolkit (GATK) (detailed documentation at https://software.broadinstitute.org/ 
gatk/documentation/). 

Reduced representation bisulfite sequencing. Reduced representation bisulfite 
sequencing (RRBS) libraries were prepared for 221 biopsies and 228 blood samples 
as described previously”’ with modifications detailed below. In brief, genomic 
DNA samples were quantified using a Quant-It dsDNA high sensitivity kit 
(ThermoFisher, Q33120) and normalized to a concentration of 10 ng/l. A total 
of 100 ng of normalized genomic DNA was digested with MspI in a 20-1l reaction 
containing 1 jl MspI (20 U/l) (NEB, RO106L) and 2 il of 10x CutSmart Buffer 
(NEB, B7204S). MspI digestion reactions were then incubated at 37°C for 2h 
followed by a 15 min incubation at 65°C. 

Next, A-tailing reactions were performed by adding 1 sl dNTP mix (containing 
10 mM dATP, 1 mM dCTP and 1 mM dGTP) (NEB, N0446S), 1 ul Klenow 3/-5! 
exo’ (NEB, M0212L) and 1 til 10x CutSmart Buffer in a total reaction volume of 
30 pl. A-tailing reactions were then incubated at 30°C for 20 min, followed by 37°C 
for 20 min, followed by 65°C for 15 min. 

Methylated Illumina sequencing adapters”’ were then ligated to the A-tailed 
material (30 1l) by adding 1 jl 10x CutSmart Buffer, 5 jl 10 mM ATP (NEB, 
P0756S), 1 pl T4 DNA Ligase (2,000,000 U/ml) (NEB, M0202M) and 2 i:l methyl- 
ated adapters in a total reaction volume of 40 jl. Adaptor ligation reactions were 
then incubated at 16°C overnight (16-20 h) followed by incubation at 65°C for 
15 min. Adaptor ligated material was purified using 1.2 volumes of Ampure 
XP according to the manufacturer’s recommended protocol (Beckman Coulter, 
A63881). 

Following adaptor ligation, bisulfite conversion and subsequent sample purifi- 
cation were performed using the QIAGEN EpiTect kit according to the manufac- 
turer’s recommended protocol designated for DNA extracted from FFPE tissues 
(QIAGEN, 59104). Two rounds of bisulfite conversion were performed yielding a 
total of 40 tl bisulfite-converted DNA. 

In order to determine the minimum number of PCR cycles required for final 
library amplification, 50 jl PCR reactions containing 3 11 bisulfite-converted DNA, 
541 10x PfuTurbo Cx hotstart DNA polymerase buffer, 0.5 jl 100 mM dNTP (25 mM 
each dNTP) (Agilent, 200415), 0.5 il Illumina TruSeq PCR primers (25 1M each 
primer)” and 1 jl PfuTurbo Cx hotstart DNA polymerase (Agilent, 600412) were 
prepared. Reactions were then split equally into four separate tubes and thermo- 
cycled using the following conditions: denature at 95°C for 2 min followed by X 
cycles of 95°C for 30 s, 65°C for 30 s, 72°C for 45 s (where X number of cycles = 11, 
13, 15 and 17), followed by a final extension at 72°C for 7 min. PCR products were 
purified using 1.2x volumes of Ampure XP and analysed on an Agilent Bioanalyzer 
using a High Sensitivity DNA kit (Agilent, 5067-4626). Once the optimal number 
of PCR cycles was determined, 200-|11 PCR reactions were prepared using 24 il 
bisulfite-converted DNA, 20 j1l 10x PfuTurbo Cx hotstart DNA polymerase 
buffer, 2 11 100 mM dNTPs (25 mM each), 2 11 Illumina TruSeq PCR primers 
(25 {tM each) and 4 l PfuTurbo Cx hotstart DNA polymerase with the thermal 
cycling conditions listed above. PCR reactions were purified using 1.2 x volumes of 
Ampure XP according to the manufacturer’s recommended protocol and analysed 
on an Agilent Bioanalyzer using a High Sensitivity DNA kit. 


RRBS sequencing produced an average of 15.0M reads (s.d. 4.0M reads) over 

all 504 samples, with 448 (88.9%) samples exceeding 10M reads. Samples were 
analysed with Picard 2.9.4 using default parameters, resulting in a mean alignment 
rate to the human genome hg19 of 95.1%. Mean CpG coverage was 8.9 x (s.d. 
2.1%). As expected, 99.9% (s.d. 0.02%) of non-CpG bases and 49.8% (s.d. 2.8%) 
of CpG bases were converted. 
Data handling. Informatics for microbial community sequencing data. For metage- 
nomes and metatranscriptomes, sequencing reads from each sample in a pool were 
demultiplexed based on their associated barcode sequence using custom scripts. Up 
to one mismatch in the barcode was allowed provided it did not make assignment 
of the read to a different barcode possible. Barcode sequences were removed from 
the first read as were terminal Gs from the second read that may have been added 
by SMARTScribe during template switching. 

Taxonomic and functional profiles were generated with the bioBakery metaom- 
ics workflow”! v0.9.0 (http://huttenhower.sph.harvard.edu/biobakery_work- 
flows). In brief, reads mapping to the human genome were first filtered out using 
KneadData 0.7.0. Taxonomic profiles of shotgun metagenomes were generated 
using MetaPhlAn2” v2.6.0, which uses a library of clade-specific markers to pro- 
vide pan-microbial (bacterial, archaeal, viral, and eukaryotic) profiling (http:// 
huttenhower.sph.harvard.edu/metaphlan2). Functional profiling was performed 
by HUMAnN2”? v0.11.0 (http://huttenhower.sph.harvard.edu/humann2). 
HUMAnN2 constructs a sample-specific reference database from the pangenomes 
of the subset of species detected in the sample by MetaPhlAn2 (pangenomes are 
precomputed representations of the open reading frames of a given species”*). 
Sample reads are mapped against this database to quantify gene presence and 
abundance on a per-species basis. A translated search is then performed against a 
UniRef-based protein sequence catalogue”° (UniRef release 2014_07) for all reads 
that fail to map at the nucleotide level. The result are abundance profiles of gene 
families (UniRef90s), for both metagenomics and metatranscriptomics, stratified 
by each species contributing those genes, and which can then be summarized to 
higher-level gene groupings such as ECs or KOs. 

Sample counts in Fig. 1b represent the numbers of raw products available. To 
ensure a reasonable read depth in each sample, only samples (metagenomes and 
metatranscriptomes) with at least 1 million reads (after human filtering) and at 
least one non-zero microbial abundance detected by MetaPhlAn2 were used in 
downstream analyses (Fig. 1c and later). In total, this was 1,595 metagenomic 
and 818 metatranscriptomic samples. Principal coordinates plots were generated 
with the cmdscale function in the R package stats. Visualizations were principally 
generated using ggplot2”. 

Species-level metaomic functional profile summaries. Functional profiles per clade 
(typically species) were further quantified by summing the total sum-normalized 
stratified abundance attributed to each organism in the HUMAnN2 functional 
profiles from both metatranscriptomics and metagenomics. For metatranscrip- 
tomic analyses, the expression ratio for the species was then also defined as the 
ratio between these sums. 

Metaproteomic gene family functional profiles. Gene family profiles were gener- 
ated from metaproteomic peptides using UniRef90 identifiers by mapping pep- 
tide sequences to the Diamond-annotated reference in HUMAnN2 v0.11.0 with 
v0.8.22.8477. Each peptide was mapped to the UniRef90 with the highest per cent 
identity (minimum 90% match). 

Statistical methods and association testing. PERMANOVA and Mantel tests. 
Omnibus testing was performed on Bray-Curtis dissimilarity matrices from MGX, 
MTX, MPX, and biopsy measurements. Functional profiles (MGX, MTX, and 
MPX) were first summarized to the KO level using HUMAnN2. Profiles were first 
normalized before calculation of dissimilarities. Dietary distance matrices were 
calculated by ordering the dietary intake frequencies from less to more frequent, 
assigning integers to these levels, and calculating the Manhattan distance. 

Quantifications of covariation between measurement types in Fig. le were done 
using Mantel tests. To quantify cross-sectional (‘“inter-individual’) covariation, we 
first produced an average profile for each subject by taking the feature-wise mean 
over all samples from the subject. Subject-subject dissimilarity matrices were then 
generated and compared using the mantel.rtest function in the R package ape4. To 
quantify longitudinal covariation, we first generated the complete sample-sample 
dissimilarity matrix, but only calculate the Mantel test statistic (the Pearson corre- 
lation between distances) from distances between samples from the same subject. 
Significance in this case was assessed using a permutation test with permutations 
limited within-subject. 

Quantifications of variance explained in Fig. 1f were calculated using 
PERMANOVA with the adonis function in the R package Vegan’*. Apart from 
the All row in Fig. 1f, the total variance explained by each variable was calculated 
independently of other variables (that is, as the sole variable in the model) to avoid 
issues related to variable ordering, and should thus be considered the total variance 
explainable by that variable. To account for the repeated measures present for all 
measurement types tested, relevant permutations were performed blocked within 


subject for variables that change over time (medication, biopsy location, inflam- 
mation status, and dysbiosis). Meanwhile, variables that were constant (or change 
slowly enough to be considered constant) across samples from the same subject 
(age, sex, body mass index (BMI), race, recruitment site, diagnosis, and disease 
location) were first permuted across subjects and samples were relabelled with 
the variable from their permuted subject. To determine the significance of models 
including inter-individual variance (the Subject and All rows), permutations were 
performed freely. For subjects with incomplete records for BMI, we imputed the 
mean BMI of the remaining population. The All row is the total variance explained 
when including all other variables in the model. 

Differential microbiome feature abundance. Differential abundance (DA) analysis 
of all microbial measurement types (except for viruses, which were modelled as 
presence/absence binary features) were tested as follows. First, an appropriate 
transformation/normalization method was applied: arcsine square-root 
transformation for microbial taxonomic and functional relative abundances, 
log transformation (with pseudo count 1 for zero values) for metabolite profiles 
and protein abundances, and log transform with no pseudocount for expression 
ratios (non-finite values removed). Transformed abundances were then fit with the 
following per-feature linear mixed-effects model: 


feature ~ (intercept) + diagnosis + diagnosis/dysbiosis + antibiotic use 


+consent age + (1|recruitment site) + (1|subject) 


That is, in each per-feature multivariable model, recruitment sites and subjects 
were included as random effects to account for the correlations in the repeated 
measures (denoted by (1 | recruitment site) and (1 | subject), respectively) and 
the transformed abundance of each feature was modelled as a function of diag- 
nosis (a categorical variable with non-IBD as the reference group) and dysbiosis 
state as a nested binary variable (with non-dysbiotic as reference) within each IBD 
phenotype (UC, CD, and non-IBD), while adjusting for consent age as a continuous 
covariate, and antibiotics as as binary covariate. Pearson's residual values from the 
above linear mixed effects models were retained for use in subsequent analyses 
(see ‘Cross-measurement type interaction testing’). 

Fitting was performed with the nlme package in R” (using the Ime function), 
where significance of the association was assessed using Wald’s test (except for 
viruses, where a logistic random effects model was considered with the glmer 
function from the mer R package). Nominal P values were adjusted for multiple 
hypothesis testing with a target FDR of 0.25. In order to reduce the effect of zero- 
inflation in microbiome data, features with no variance or with >90% zeros were 
removed before fitting linear models. In addition, a variance filtering step was 
applied to remove features with very low variance. To further remove the effect of 
redundancy in KO gene family abundances (explainable by at most a single taxon), 
only features (both DNA and RNA) with low correlation with individual microbial 
abundances (Spearman correlation coefficient <0.6) were retained. 

Differential host gene expression. Differentially expressed human genes between 
disease groups were quantified using a quasi-likelihood negative binomial gener- 
alized log-linear model (glmQLFit), implemented in the edgeR package in R®°°*!, 
Analysis was performed separately for each section of the intestine on genes with at 
least 2 CPM (counts per million) in 10 or more samples, with significance threshold 
FDR P < 0.05 and >1.5 log-fold change. Gene enrichment analysis was performed 
on differentially expressed genes against the KEGG database using a one-sided 
hypergeometric test in the package limma®. 

Associations with host gene expression. Associations between host gene expression 
and biopsy taxonomic profiles were assessed using partial Spearman correlation, 
accounting for BMI, age at consent, sex and diagnosis. Association testing was 
performed for each biopsy location independently, as biopsy location was shown 
to heavily influence expression profiles (Figs. 1f, 4a, b). This simpler method was 
used rather than the more complex procedure outlined above for microbial meas- 
urement types since host gene expression, once filtered by biopsy location, do 
not have the same repeat measures problem as the microbial measurement types, 
allowing a simpler test. 

Genetic associations. Genetic principal components for IBDMDB subjects 
as well as 1,000 Genomes subjects*! were calculated for a set of independ- 
ent SNPs overlapping between the two data sets and pruned on the basis of 
linkage disequilibrium (LD). Pruning was first performed in HMP2 using the 
-indep-pairwise 1500 150 0.1 command in PLINK® by calculating LD (r’) for 
each pair of SNPs within a window of 1,500 SNPs, removing one of a pair of 
SNPs if r? > 0.1 and repeating this procedure by shifting the window 150 SNPs 
forward. We then used the 1,000 Genomes reference phase 3 version 5a data 
for 2,504 participants (http://bochet.gcc.biostat.washington.edu/beagle/1000_ 
Genomes_phase3_v5a) to merge with the HMP2 pruned data, resulting in 7,227 
overlapping independent SNPs. Using these, we performed genome-wide esti- 
mation of identity-by-descent allele sharing on the combined data set using 
the -genome function in PLINK, followed by calculation of genetic principal 
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components using the —cluster-mds-plot function for the first two principal 
components (Fig. 1a). 

For association analyses, we used first 20 genetic principal components as 
covariates, obtained from the same identity-by-descent sharing matrix using the 
-cluster-pca 20 function. We targeted associations in five loci that had strong 
previously reported associations with IBD and/or have been implicated in micro- 
bial interactions****, To avoid confounding by ancestry, we restricted the analysis 
to subjects of European ancestry, excluding eight subjects with exomes availa- 
ble from other ancestral backgrounds. When available, we used reported SNPs 
that had minor allele frequency of at least 5% and Hardy-Weinberg equilibrium 
P<5x 10-%. Ifnot, we used close proxies (LD 7? < 0.8 in CEU population using 
1,000 genomes phase 3 version 5 reference via http://analysistools.nci.nih.gov/ 
LDlink (MST1, FUT2, IRGM, NKX2-3) or SNPs from the gnomAD browser at 
http://gnomad.broadinstitute.org (PTGER4)). 

We used the following linear mixed effect model with the SNP as a predictor var- 
iable, coded with an additive genetic model with the outcome as the arcsine-square 
root transformed microbial relative abundance measured from stool metagenomes. 
Age, sex, antibiotic and immunosuppressant use, and the first 20 genetic principal 
components (PCs) were fitted as covariates with subjects as the random effect: 


taxon ~ intercept + SNP + antibiotic use + sex + age 
+recruitment site + PC1—PC20 + (1|subject) 


Optimization was performed using the Ime function (from the nlme R package), 
with P values calculated using the Wald test. 

Associations between the rs1042712 SNP of the LCT locus®? and self-reported 
milk intake from dietary recall forms were tested using the same mixed effect 
model. Reported dairy intake options were assigned the following numeric values 
for regression: 1 (‘yesterday, 3 or more times’), 2 (‘yesterday, 1 to 2 times’); 3 (‘within 
the past 2 to 3 days’), 4 (‘within the past 4 to 7 days’), and 5 (‘did not consume in 
last 7 days’). 

Density ridgeline plots of differentially abundant features. To visualize the abun- 
dances of features that showed significantly different abundances by one of the tests 
above (Fig. 2, Extended Data Fig. 4), the bandwidth for kernel density estimation 
was selected independently for the portion of each feature above the detection limit 
(non-‘zero’) using the Sheather and Jones method”. Density estimates were scaled 
such that the maximum density for the plot spanned the distance between base- 
lines for a given disease group. Samples below the detection limit are represented 
as barplots on the left, where a bar that spans the distance between baselines for 
a disease group represents 100% zeros. Density estimates were then additionally 
scaled by the fraction of non-zero samples such that relative differences in densities 
between groups with differing fractions of zeros are accurately represented. For 
both density estimates and fraction of zeros, samples were weighted by the inverse 
of the number of samples obtained from that subject, to avoid biasing estimates 
towards subjects with more densely sampled time series. 

Dysbiosis analyses. Dysbiosis score. To identify samples with highly divergent 
(dysbiotic) metagenomic microbial compositions, as a complement to baseline 
disease diagnosis, we defined a dysbiosis score based on Bray-Curtis dissimilarities 
to non-IBD metagenomes. First, a ‘reference set’ of samples was constructed from 
non-IBD subjects by taking all samples after the 20th week after the subject's first 
stool sample. This was chosen because a subset of the non-IBD subjects at the 
start of their respective time series may not yet have overcome any gastrointestinal 
symptoms that triggered the initial visit to a doctor, though these were ultimately 
not caused by IBD. The dysbiosis score of a given sample was then defined as the 
median Bray—Curtis dissimilarity to this reference sample set, excluding samples 
that came from the same subject (Fig. 2c). 

To identify samples that were highly divergent from the reference set, we thresh- 
olded the dysbiosis score at the 90th percentile of this score for non-IBD samples. 
This therefore identifies samples with a feature configuration that has a less than 
10% probability of occurring in a participant without IBD. By this measure, 272 
metagenomes were classified as dysbiotic. Samples from participants with CD 
or UC were overrepresented in the dysbiotic set, with 24.3% and 11.6% of their 
samples classified as dysbiotic, respectively. As expected, these samples also tended 
to locate in the extremes of the taxonomic ordination based on metagenomes 
(Extended Data Fig. 3b, c). Dysbiosis was unevenly distributed among subjects 
(Extended Data Fig. 3d), with some subjects remaining dysbiotic for all or most of 
their time series, while others remained non-dysbiotic for their entire time series. 

To lend additional support to the definition of dysbiosis (that is, as outliers by 
one type of microbiome profile), we tested the concordance between dysbiosis clas- 
sifications made using the same statistical definition, but applied to metabolomic 
rather than taxonomic profiles. That is, we defined a metabolomic dysbiosis score 
as the median Bray—Curtis dissimilarity of one metabolomic profile to the non-IBD 
metabolomic profiles (after the 20th week), and defined the dysbiosis threshold 
as the 90th percentile of this distribution among non-IBD metabolomic profiles. 
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We then compared these dysbiosis classifications with those from the nearest 
metagenomic sample (up to two weeks, see ‘Cross-measurement type temporal 
matching’) and found that dysbiotic samples identified by metagenomics were 4.6 
times more likely to be dysbiotic by metabolomics (Fisher's exact P= 5.9 x 107°), 
showing that dysbiosis measurements are highly consistent across measurement 
types. 

To test the sensitivity of the dysbiosis classification to the choice of reference 
data set, we also performed the dysbiosis classification using the HMP1-I] stool 
samples!” as the reference sample set instead of the non-IBD samples. The result- 
ing dysbiosis scores (Extended Data Fig. 3e) were highly concordant (Spearman 
p = 0.86; P < 2.2 x 107°), as were the dysbiosis classifications (odds ratio of 56; 
Fisher’s exact P < 2.2 x 10-1). This shows that, despite the inclusion of subjects 
with other conditions in the non-IBD group here, as well as large differences in 
measurement technologies between the data sets, the dysbiosis classification is 
highly robust. Furthermore, 43 out of 426 (10.1%) of non-IBD samples were clas- 
sified as dysbiotic using the HMP1-II samples as reference, falling remarkably 
close to the 10% expected by the definition and showing that the enrichment of 
IBD samples in the dysbiotic set is not simply a consequence of the definition. 
Dysbiosis durations and intervals. Samples of the dysbiosis durations and intervals 
were obtained by taking the difference in time between stool metagenomes in 
which the dysbiosis state changes, that is, the time from the first dysbiotic sam- 
ple in an excursion into dysbiosis until the next non-dysbiotic sample was taken 
as one sample of the dysbiosis duration distribution. If the subject’s time series 
ended before this transition occurred, this resulted in a ‘censored’ duration or 
interval (Fig. 2e). Estimates of the durations of and time between dysbioses were 
then obtained from a censored maximum likelihood estimator for the mean of 
an exponential distribution. This incorporates the censored durations and inter- 
vals into the estimate to avoid underestimating these durations owing to limited 
observation times. 

Association of dysbiosis with disease location. We tested for a relationship between 
the Montreal disease location classification in CD and periods of dysbiosis to 
ensure that dysbiosis was not simply detecting different disease locations. For 
this, an F-test of no significance was used with a Kenward—Roger approximation 
of degrees of freedom” in a logistic random effects regression that models dysbi- 
osis as the binary outcome with subjects as a random effect and disease location 
as covariate, as implemented in the function glmer in the R package Ime4. Only 
individuals with CD were considered. 

Temporal analyses. Power-law fits to Bray-Curtis dissimilarities. Power-law fits to 
species-level metagenomic, metatranscriptomic, and metabolite Bray-Curtis dis- 
similarities (Fig. 3a, Extended Data Fig. 5a) were performed by fitting a power-law 
curve with free intercept by least-squares using the neldermead function from the 
R package nloptr. Significance was assessed using an F-test to compare the fit model 
with a flat line. Significance of the difference of the fit between disease groups was 
also assessed using an F-test, comparing a model jointly fit to both disease groups 
with separate fits to each group. 

Microbiome shifts. A microbiome ‘shift’ was defined as having occurred between 
two consecutive time points from the same person if the Bray—Curtis dissimilarity 
between their profiles was more likely to have come from a comparison between 
samples from different people rather than from the same person (Extended Data 
Fig. 5b). As an individual's microbial profile naturally changes over time!”, the 
Bray—Curtis threshold at which this occurs will increase with the time difference 
between samples. To determine these thresholds, kernel density estimates were 
generated for the distribution of Bray—Curtis dissimilarities between profiles from 
different individuals without IBD and between samples from the individuals with- 
out IBD at a range of time differences, using the density function in R. The point at 
which the inter-individual density estimate exceeded the intra-individual density 
estimate was then taken as the threshold to define a ‘shift with the additional con- 
straint that this must be a monotonically increasing function of the time difference 
between samples (Fig. 3a). Metabolomic shifts were defined similarly, although 
owing to the more sparse temporal sampling of the metabolomics data (Extended 
Data Fig. 1) and lack of a strong upward trend in Bray—Curtis dissimilarities with 
time difference (Fig. 3a), only a single threshold was used based on the distribu- 
tion of Bray—Curtis dissimilarities from comparisons within-subject over time in 
participants without IBD. Heatmaps of shift differences were generated using the 
R package pheatmap 1.0.10°°. 

Longitudinal multi-omic study design. Owing to the large variation in microbial 
profiles between people (Supplementary Fig. 2), with relatively smaller variation 
within subjects over time (Fig. 3a), longitudinal study designs have the potential to 
be higher-powered than purely cross-sectional studies, particularly in their ability 
to self-control individuals and to capture transitions between phenotypes (or after 
interventions) of interest”*. Here, although some subjects remained in a dysbiotic 
state far longer than others (Extended Data Fig. 3d), the heterogeneity observed 
was enough to discover differences in measurement types other than where 
dysbiosis was defined. For example, subjects who had unusual, disease-associated 


microbiome taxonomic profiles also proved to have generally shifted serological 
and/or metabolomic profiles at corresponding time points. Noting these dysbi- 
otic time points offered a complementary set of differences to what was visible 
cross-sectionally (Fig. 2). 

Among microbially related measurements, metabolomics provided the most 
robust separation between disease and dysbiosis groups, possibly because it inte- 
grated a combination of host, microbial, and dietary differences (Fig. le, f). Thus, 
despite the challenges presented by untargeted metabolomics, such as unknown 
compound identification, the presence of redundant and background signals, and 
the complexity of the stool matrix, this measurement type often provides a robust 
characterization of subjects, their disease state, and individual small molecules that 
interface between host and microbiome. Conversely, the current state of viromics 
assays and reference databases makes this more challenging to work with, although 
the importance of the virome in microbial community dynamics” will make this 
an extremely interesting feature space going forward. 

All longitudinal microbial measurement types showed significant variation 

within two weeks (Fig. 3a, Extended Data Fig. 5a). This suggests that even higher 
sampling rates may be needed to catch relevant microbial variation, particularly 
before the onset of more severe clinical symptoms. Our sampling protocol also did 
not account for other potential sources of within-subject variation, such as transit 
time or the precise portion of each whole stool that was sampled. In this data set, 
we thus cannot distinguish between temporal and technical variation within sub- 
jects. To mitigate the higher costs associated with processing additional samples, 
a future study aiming to achieve higher temporal resolution might proceed in two 
phases, thanks to the coupling between data types (Fig. le): first, collect samples 
at a higher frequency, and process these with metagenomic or 16S sequencing. 
Then proceed with more expensive and detailed data generation only for samples 
taken specifically around periods of interest, such as periods of dysbiosis identified 
during the first stage. To this end, the sampling rate can also be tuned to target a 
particular probability of missing a dysbiosis period. 
Integrative analyses. Lenient cross-measurement type temporal matching. For 
comparison between multiple measurement types, we first constructed sets of 
samples corresponding to the same biosample across data sets. However, exact 
matches were not always possible, for example, owing to specimen limitations 
during sample selection (see ‘Sample selection’) or to samples that failed quality 
control. In these cases, matching sample sets across data sets were created using 
nearby samples. During this process, a degree of leniency was allowed in the match- 
ing, allowing samples up to a given time difference (two or four weeks) apart to 
be ‘matched. To perform this matching (sample numbers in Fig. 1c and Extended 
Data Fig. 1b), we used the following algorithm. 

For a given set of measurement types to be matched for a particular subject, find 

the first time window in which all measurement types have at least one produced 
sample. Next, within this window, find the time point with the most measurement 
types produced; in the event of a tie, select earlier time points. Finally, for each 
data set, select the nearest sample to this target time point that is within the time 
window, breaking ties towards earlier time points. This set of selected samples 
comprises one ‘matched’ sample. For each data set, all samples up to and includ- 
ing the later of the selected sample or the target time point are then disregarded 
from future consideration (and thus any sample will be included in at most only 
one matched time point). This process is repeated for each subject until no such 
window exists. 
Cross-measurement type interaction testing. Significant associations between fea- 
tures from multiple measurement types were identified using two different models: 
an ‘unadjusted’ model of associations that are mainly due to dysbiosis, and an 
‘adjusted’ model that emphasizes associations in addition to those that are dysbiosis- 
linked. Associations in both cases incorporated features from ten data sets: 
metagenomic species, species-level transcription ratios, functional profiles at the 
EC level (MGX, MTX and MPX), metabolites, host transcription (rectal and ileal 
separately), serology and faecal calprotectin. 

To detect adjusted associations, we first obtained residuals of features from the 
above data types fit to a mixed-effects model including subjects as random effects 
as above for the differential abundance testing (or a simple linear model without 
the random effects when only baseline samples were used) and adjusting for age, 
sex, diagnosis, dysbiosis status, antibiotic and immunosuppressant use, and bowel 
surgery status. Residuals from subjects with fewer than four samples in their time 
series for the measurement type were ignored, and for measurements with no 
longitudinal samples (for example, serology and host transcriptomics measure- 
ments), the residualization was repeated with the first available samples using a 
simple linear model without random effects. This allowed the identification of 
significant (FDR P < 0.05 for most measurement types, P < 0.25 for serology) 
Spearman associations using HAIIA 0.8.17 (hierarchical all-against-all association 
testing, http://huttenhower.sph.harvard.edu/halla, Supplementary Table 35). As 
subject-specific random effects and covariate effects were removed from these 
residuals, the resulting correlations are likely to be independent of all sources of 


inter-individual variation as well as confounding effects due to the covariates. 
See Fig. 4c and Extended Data Figs. 7-9 for summary visualizations of these results. 
Similarly, unadjusted associations were identified using the same procedure, but 
without including dysbiosis as a covariate (Supplementary Table 36). Network 
visualization was done using Cytoscape” 3.6.0. 

Reporting summary. Further information on research design is available in the 
Nature Research Reporting Summary linked to this paper. 


Data availability 

Protocols and data (both raw and summarized to data type-dependent profiles) are 
available at the IBDMDB website (https://ibdmdb.org), the HMP DACC web portal 
(https://www.hmpdacc.org/ihmp/), and Qiita®” (https://qiita.ucsd.edu/). Sequence 
data are available from SRA BioProject PRJNA398089. Expression data have been 
deposited in the NCBI Gene Expression Omnibus” and is accessible through 
GEO Series accession number GSE111889. Metabolomics data are available at the 
NIH Common Fund’s Metabolomics Data Repository and Coordinating Center 
(supported by NIH grant U01-DK097430) website, the Metabolomics Workbench 
(http://www.metabolomicsworkbench.org), where it has been assigned Project 
ID PR000639. Mass spectrometry proteomics data have been deposited to the 
ProteomeXchange Consortium via the PRIDE” partner repository with the data 
set identifiers PXD008675 and 10.6019/PXD008675. Reprints and permissions 
information is available at www.nature.com/reprints. 


Code availability 

Bioinformatics workflows for metagenomics and metatranscriptomics data are 
available at https://bitbucket.org/biobakery/hmp2_workflows. Analysis scripts 
are available at https://bitbucket.org/biobakery/hmp2_analysis. 
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Extended Data Fig. 1 | Distribution of sample and measurement types 
and timing. a, Measurements available over time for each IBDMDB 
participant. b, Number of identical or closely-aligned time points available 
for which each measurement type has been generated (see Methods). 


c, Distributions of the number of processed samples per subject, stratified 
by disease and measurement type. d, Distributions of time intervals 
between consecutive samples for each measurement type. 
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Extended Data Fig. 2 | Within-individual stability is a major driver 

of microbiome differences across measurement types. a, PCoA and 
t-SNE embeddings based on Bray—Curtis dissimilarity matrices from 
stool species abundances, transcripts, proteins, and metabolites. Marginal 
densities are shown for the PCoAs that show disease separation for 

some measurements. In the t-SNEs, each subject has been assigned a 
different hue, showing that small clusters generally represent individuals’ 
time courses, as inter-individual differences are the greatest driver 

of microbiome variation across measurement types (Fig. 1f). Sample 
counts are shown in Fig. 1b. b, Distributions of correlations between 
functional profiles, captured as UniRef90'™ gene family abundances”’, 


PC1 (21%) 


PCo 1 (15.4%) 


measured from paired metagenomes, metatranscriptomes, and 
metaproteomes (see Methods). c, Human transcriptional expression was 
mostly determined by biopsy location rather than IBD phenotype and 
inflammation (n = 249 samples from 91 subjects). Ordination shows PCA 
on gene expression levels normalized by library sizes and represented as 
CPM. Ellipses indicate 95% confidence regions for the indicated sample 
types. d, Principal coordinates plot (Bray—Curtis on OTU profiles) of 
community profiles from biopsy samples shows that mucosal microbial 
communities do not differ significantly by biopsy location (shape), unlike 
human gene expression in epithelial tissue (Fig. 4b). 
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Extended Data Fig. 3 | Patient-reported, molecular, and microbial 
disease activity measures. a, Relationships between six measures of 
disease activity: the patient-reported HBI in CD, patient-reported SCCAI 
in UC, faecal calprotectin, the fractions of human reads from stool MGX 
and MTX, and a dysbiosis score defined here as departures from control 
population microbiome configurations (Fig. 2c). Rho values are Spearman 
correlations with ties broken randomly. Linear regression is shown (red 
line) with 95% confidence bound (shaded). Sample counts are presented 
in the title bar, though sample counts for a particular correlation may be 
less as samples must be paired. b, c, PCoA based on metagenomic species- 
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level Bray-Curtis dissimilarities (n = 1,595 samples from 130 subjects), 
indicating dysbiosis score (b) and whether the sample was defined as 
dysbiotic (c). d, Number of dysbiotic samples per participant. Colour 
scheme as in c. e, Relationship between the dysbiosis score, when using the 
HMP1-II gut data set as reference (n = 553 from 249 subjects), compared 
to the non-IBD data set. The threshold for the dysbiosis classification is 
also shown (black lines). The two scores are highly correlated (Pearson 

p = 0.86; two-sided P < 2.2 x 107°), as are the resulting dysbiosis 
classifications (odds ratio of 56; Fisher’s exact test P < 2.2 x 10~!). 

n= 1,595 samples from 130 subjects. 
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Extended Data Fig. 4 | Significant microbial and metabolic Supplementary Table 15). c, Top 10 differentially abundant acylcarnitines 
perturbations during taxonomic dysbioses. a, Alpha diversity in dysbiosis (n = 546 samples from 106 subjects; Wald test). Carnitine and 
(Gini-Simpson) as a function of the dysbiosis score for the sample acylcarnitines are more abundant in dysbiotic CD, whereas C20:4 carnitine 
(Pearson correlation -0.60; P < 2.2 x 1071%). n = 1,595 samples from is significantly depleted (Supplementary Table 16). d, Top 10 differentially 
130 subjects. b, Seven (chosen for space constraints) most differentially abundant metabolites during dysbiosis not shown elsewhere in this 
abundant species not shown elsewhere in this manuscript (n = 1,595 manuscript (n = 546 samples from 106 subjects; Wald test; full results in 


samples from 130 subjects; Wald test; see Methods; full results in Supplementary Table 16). 
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Extended Data Fig. 5 | See next page for caption. 


Extended Data Fig. 5 | Detecting shifts in longitudinal microbiome 
multi-omics. a, Distributions of Bray-Curtis dissimilarities as a function 
of time difference between samples for protein profiles, species-level 
transcriptional activity (see Methods), and species-level taxonomy 
(though excluding subjects with dysbiosis at any time point), otherwise as 
in Fig. 3a. Removing subjects with dysbiotic samples removes the extreme 
dissimilarities (near 1) observed in IBD subjects. Boxplots show median 
and lower/upper quartiles; whiskers show inner fences. b, Distribution 

of Bray-Curtis dissimilarities between samples from the same subject, 
two weeks apart versus those from different individuals, allowing us 

to define a ‘shift’ in the microbiome as a change more likely to have 

been drawn from the between-subject distribution than within-subject 
distances (corresponding to Bray-Curtis > 0.54). c, Relative abundance 
differences of the top ten microorganisms that contributed to each of the 
183 detected taxonomic shifts among any two within-subject subsequent 
time points. Shifts are typically reciprocal (that is, losing a microorganism 
and regaining it later, or vice versa), and microorganism with frequent 
high-abundance shifts generally correspond to frequent contributors in 
Fig. 3b. Sample ordering is from a hierarchical clustering using average 
linkage followed by optimal leaf ordering!’. d, As in Fig. 3c, but for E. coli 
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(n = 322 samples from 24 subjects; two-tailed Wilcoxon test of the 
absolute differences in relative abundances between consecutive time 
points P = 2.2 x 10~‘ for non-IBD to UC, and P = 0.029 for non-IBD 

to CD), which is frequently implicated in gut inflammation. e, As in 

b, but showing Bray—Curtis dissimilarities of metabolomic profiles. Here, 
22% (96 out of 440) of sample pairs exceed the shift threshold, whereas 
13% (183 out of 1,413) exceed the threshold in b. If metagenomic profiles 
are sub-sampled to match the metabolomics samples, this increases to 

14% (57 out of 398) of sample pairs, showing that if we increased the 
sampling rate, this measurement type would be likely to shift more than 
the metagenomes. f, As in Fig. 3b, but showing the primary contributors 
to metabolomic shifts, that is, the metabolite with the largest change in 
relative abundance during a shift. Note that other metabolites may still 
experience large changes in abundance (for example, for this reason, urate 
was not a primary contributor to any non-IBD shifts, though large changes 
are visible for one non-IBD individual in Fig. 3e). The full table of detected 
metabolomic shifts is given in Supplementary Table 30. Violin plot shows 
the density of points around that intake frequency; bandwidth chosen 
automatically by Silverman's method'™. 
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Extended Data Fig. 6 | Mucosal communities and human genetics. 

a, Number of biopsy samples available for each biopsy location and 
inflammation status. b, DEGs (Fig. 4a) with newly identified significant 
correlations with OTU abundances in biopsies (partial Spearman 
correlation conditioned on disease status, BMI, age at consent and sex; 
FDR P < 0.05; n = 54 in ileum and n = 52 independent 16S-RNA-seq 
pairs; full table in Supplementary Table 33). c, A limited subset of the 
microbiome trended with genetic variants in targeted testing, including 
the strongest trend shown here of Parabacteroides distasonis with 
genotypes of NKX2-3 (a known IBD-associated locus!°*; boxplots show 
median and lower/upper quartiles; whiskers show inner fences). This is 
the most significant association by P value among all tested associations 
between metagenomic taxa and five known IBD loci (nominal significance 
P = 0.006; no associations passed FDR P < 0.05, mixed effect model 
with age, sex, antibiotic and immunosuppressant use and first 20 genetic 
principal components as covariates while specifying subjects as random 
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effects; Wald test; n = 84 subjects of European ancestry with exomes and 
960 metagenomes; full results in Supplementary Table 34). d, Association 
between rs1042712 SNP in the LCT locus and self-reported milk intake 
from dietary recall. Self-reported short-term milk intake (from dietary 
recalls accompanying stool samples) was significantly associated with 
the count of C alleles (29.8% allele frequency) at rs1042712 in the LCT 
gene locus using a linear mixed effect model accounting for age, sex, first 
20 genetic principal components and with subjects as random effects 

(P = 0.028, linear mixed effect regression with Wald test, see Methods). 
All available data are plotted for unique subjects of European ancestry 
with exome data (per-genotype subject count (GG/GC/CC):50/26/8). 
Differences between IBD and non-IBD groups are not statistically 
significant (odds ratio 0.27; 95% CI 0.05-1.33; P = 0.10; n = 84 subjects 
of European ancestry with exomes and 960 dietary surveys; model: IBD 
(yes|no) ~ intercept + SNP + sex + age + PC1-PC20). 
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Extended Data Fig. 7 | Microbial and host-related subsets of the multi- 
omic association networks. a, Subset of the network in Fig. 4c showing 
metagenomic abundances (octagons) and expression levels (hexagons) of 
Subdoligranulum, Roseburia spp. and EF. prausnitzii and their neighbours 
(three functionally associated microbial hubs selected for further 
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investigation based on anti-inflammatory associations in the literature, 
see text). b, The host expression-related subnetwork of the ‘unadjusted’ 
association network (Extended Data Fig. 9, Supplementary Discussion). 
Sample counts in Fig. 1b, c. 
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Extended Data Fig. 8 | Significant covariation among multi-omic 
components of the gut microbiome and host interactors in IBD 
(adjusted). Detailed labelling of the association network in Fig. 4c 
(intended for magnification). The network was constructed from ten data 
sets: metagenomic species, species-level transcription ratios, functional 
profiles at the EC levels (MGX, MTX and MPX), metabolites, host 


transcription (rectal and ileal separately), serology and faecal calprotectin. 


As in Fig. 4c, measurement types were approximately matched in time 
with a maximum separation between paired samples of four weeks. The 
top 300 significant correlations (FDR P < 0.05) among correlations 
between features that were differentially abundant in dysbiosis were 
used to construct the network visualized here (for serology, a threshold 
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of FDR P < 0.25 was used). Nodes are coloured by the disease group in 
which they are ‘high, and edges are coloured by the sign and strength of 
the correlation. For this adjusted network, Spearman correlations were 
calculated using HANA from the residuals of a mixed-effects model with 
subjects as random effects (or a simple linear model without the random 
effects when only baseline samples were used) after adjusting for age, sex, 
diagnosis, dysbiosis status, recruitment site, and antibiotics (see Methods). 
Appropriate normalization and/or transformation for each measurement 
type was performed independently before the model fitting (see Methods). 
Singleton node pairs were pruned from the network. Source associations 
are in Supplementary Table 35, sample counts in Fig. 1b, c. 
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Extended Data Fig. 9 | Significant covariation among multi-omic 
components of the gut microbiome and host interactors in IBD 
(unadjusted). The network was constructed from ten data sets: 
metagenomic species, species-level transcription ratios, functional profiles 
at the EC levels (MGX, MTX and MPX), metabolites, host transcription 
(rectal and ileal separately), serology and faecal calprotectin. As in Fig. 4c, 
measurement types were approximately matched in time with a maximum 
separation between paired samples of four weeks. The top 300 significant 
correlations (FDR P < 0.05) among correlations between features that 
were differentially abundant in dysbiosis were used to construct the 


network visualized here (for serology, a threshold of FDR P < 0.25 was 
used). Nodes are coloured by the disease group in which they are ‘high, 
and edges are coloured by the sign and strength of the correlation. For this 
unadjusted network, Spearman correlations were calculated using HALA 
from the residuals of the same model as in Extended Data Fig. 8, though 
without adjusting for dysbiosis (see Methods). Appropriate normalization 
and/or transformation for each measurement type was performed 
independently before the model fitting (see Methods). Singleton 

node pairs were pruned from the network. Source associations are in 
Supplementary Table 36, sample counts in Fig. 1b, c. 
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Extended Data Table 1 | IBDMDB cohort characteristics 


Cedars-Sinai Emory Cincinnati MGH MGH Ped. Total 

Subjects 33 11 33 38 17 132 

g non-IBD 1 1 9 13 3 i 

= UC 12 3 7 11 5 38 

ra CD 20 7 17 14 9 67 

x Female 24 6 16 15 3 64 

n Male 9 5 17 23 14 68 

American Indian/Alaska Native 1 1 

4 Black or African American 6 2 2 10 

‘= More than one race 3 1 1 5 

iu Other 1 3 4 

White 32 2 30 32 16 112 

Age (years) 45.0 + 13.0 12.6 43.8 12.9429 35.9+14.5 13.94 3.2 27.5+17.3 

[26 - 76] [7 - 17] [6 - 17] [18 - 74] [6 - 17] [6 - 76] 

Height (cm) 169.7+9.5 148.7 + 10.7 157.9 + 15.8 169.4+11.4 164.1 + 20.8 163.5 + 15.5 

[152 - 188] [137 - 158] [112 - 182] [142 - 188] [114.3 - 193] [112 - 193] 

Weight (kg) 66.4 + 13.0 43.94 4.1 65.0 + 23.5 71.8 + 16.6 53.9 + 18.6 65.1 + 20.1 

[54.9 - 87.1] [41.0- 46.8]  [19.3- 115.7] [37 - 100] [19.6 - 75.0] [19.3 - 115.7] 

BMI 28.4 + 10.2 18.8 + 3.2 21.1+6.1 24.4+5.1 19.1444 22.9+7.1 

[19.5 - 50.2] [16.3 - 22.4] [13.5 - 35.6] [19.1 - 40.6] [13.5 - 28.4] [13.5 - 50.2] 

6 1 1 2 8 11 

8 L1+L4 4 1 5 

5 L2 2 2 1 5 

5 L2+L4 1 1 

5 L3 9 1 5 2 2 19 

= L3+L4 1 7 4 12 
a 

rs) N/A 8 3 2 1 14 

ro Ulcerative proctitis 1 1 2 

£ Left-sided UC 2 2 1 5 

5 Extensive UC / Pancolitis 2 1 3 2 1 9 

=) N/A 8 2 1 8 3 22 


Breakdown of the cohort by diagnosis, sex, ethnicity, physical characteristics, and disease location for each clinical site. Numeric data are summarized as mean + s.d., with the range given by 
[min-max], and with blank cells for NA (not applicable). Subject counts for disease locations are subsets of the respective diseases. 
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Describe how sample size was determined. The target sample size calculated for at least n=72 subjects with repeated measures was 
designed to have power of 0.9 to detect 1) between-group differences in taxon abundance 
(repeated measures ANOVA, group F > 0.4), 2) differentially expressed transcripts (Edland’s 
test for a linear mixed model with random slope, d > 0.07), and 3) multi'omic correlations 
(Pearson correlation, r > 0.6). Power calculations incorporated conservative Bonferroni p- 
value correction, with numbers of post-QC microbial features and within-sample correlations 
estimated from previous microbiome studies. 


2. Data exclusions 


Describe any data exclusions. Potential subjects were excluded from the study if they were unable or did not consent to 
provide tissue, blood, or stool, were pregnant, had a known bleeding disorder or an acute 
gastrointestinal infection, were actively being treated for a malignancy with chemotherapy, 
were diagnosed with indeterminate colitis, or had a prior, major gastrointestinal surgery such 
as an ileal/colonic diversion or j-pouch. These criteria were established prior to the study 
start. Samples were filtered based on data type-specific quality control measures. For 
metagenomes and metatrascriptomes, samples were required to have >1M reads and at 
least one species detected by MetaPhlAn2. 


3. Replication 


Describe the measures taken to verify the reproducibility | The study was a large-scale clinical cohort and we did not attempt to replicate all aspects of 

of the experimental findings. sample collection and data generation. However, data and source code for computational 
tools used are available to the public and therefore all of our analysis can be reproduced 
using our methods or re-analyzed using other methods. When possible, we refer to existing 
literature that supports our findings. Multiple pilot studies as well as technical replicates 
covering a subset of samples are also available, and these data were successfully integrated 
into subsequent multi-batch analyses, ensuring that data generation methods produced 
reproducible results. 


4. Randomization 


Describe how samples/organisms/participants were Experimental groups could not be randomized as they depended on diagnosis. Participants 

allocated into experimental groups. were recruited into the three disease groups as available from each of the recruitment sites. 
Upon enrollment, an initial colonoscopy was performed to determine study strata. Subjects 
not diagnosed with IBD based on endoscopic and histopathologic findings were classified as 
“non-IBD” controls, including the aforementioned healthy individuals presenting for routine 
screening, and those with more benign or non-specific symptoms. This creates a control 
group that, while not completely “healthy”, differs from the IBD cohorts specifically by clinical 


IBD status. 
5. Blinding 
Describe whether the investigators were blinded to Samples were collected by clinical staff who were not blinded as they needed to examine 
group allocation during data collection and/or analysis. patients to determine which experimental group they should be allocated to. All data were 


generated by investigators that were blinded to the metadata. Once data were generated, 
computational analysis was performed with all of the necessary clinical information to test 
between groups. 


Note: all in vivo studies must report how sample size was determined and whether blinding and randomization were used. 


6. Statistical parameters 


For all figures and tables that use statistical methods, confirm that the following items are present in relevant figure legends (or in the 
Methods section if additional space is needed). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement (animals, litters, cultures, etc.) 


l A description of how samples were collected, noting whether measurements were taken from distinct samples or whether the same 
sample was measured repeatedly 


A statement indicating how many times each experiment was replicated 


The statistical test(s) used and whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of any assumptions or corrections, such as an adjustment for multiple comparisons 


O Test values indicating whether an effect is present 
Provide confidence intervals or give results of significance tests (e.g. P values) as exact values whenever appropriate and with effect sizes noted. 


A clear description of statistics including central tendency (e.g. median, mean) and variation (e.g. standard deviation, interquartile range) 


Fal Clearly defined error bars in all relevant figure captions (with explicit mention of central tendency and variation) 


=) 
fed) 
= 
e 
= 
O 
= 
o 
Wn 
© 
fev) 
= 
(@ 
=i 
oO 
Wn 
ar 
@ 
= 
a 
oO 
Nn 
= 
O 
ze} 
oO 
a 
=) 
a 
n 
e 
fev) 
= 
< 


See the web collection on statistics for biologists for further resources and guidance. 


>» Software 


Policy information about availability of computer code 


7. Software 
Describe the software used to analyze the data in this Analysis of mass spectra: MSGF+ software v10072 
study. Sequence processing: Picard 2.9.4 


Metagenome and metatranscriptome profiles generated using bioBakery meta’omics 
workflow v0.9.0 (Preprocessing: KneadData 0.7.0; Taxonomic profiles: MetaPhlAn2 v2.6.0; 
Functional profiles: HUMAnN2 v0.11.0 with UniRef release 2014 07, Diamond v0.8.22.84) 
Viral profiles: VirMAP 

16S analysis: USEARCH v7.0.1090 
Data analysis and plotting: R (main packages: edgeR, nlme, ggplot2, ggridges, pheatmap, 
vegan, tsne), Julia plots package 
Network construction and visualization: HAIIA 0.8.17, Cytoscape 3.6.0 


For manuscripts utilizing custom algorithms or software that are central to the paper but not yet described in the published literature, software must be made 
available to editors and reviewers upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). Nature Methods guidance for 
providing algorithms and software for publication provides further information on this topic. 


>» Materials and reagents 


Policy information about availability of materials 
8. Materials availability 


Indicate whether there are restrictions on availability of Remaining biospecimen aliquots from the project are available by request from the 
unique materials or if these materials are only available | Corresponding author. 
for distribution by a third party. 


9. Antibodies 


Describe the antibodies used and how they were validated Antibodies were used for serology and the ELISAs we performed are not commercially 
for use in the system under study (i.e. assay and species). _ available as kits or reagents (except for secondary antibodies). 
The primary antibodies are patient sera (validation not applicable). 
Conjugated polyclonal secondary antibodies are utilized, clone not applicable: 
Jackson ImmunoResearch Laboratories goat anti-human IgG-alkaline phosphatase; Cat # 
109-056-098; 1:1000 dilution 
Jackson ImmunoResearch Laboratories goat anti-human IgA-alkaline phosphatase; Cat # 
109-055-011; 1:1000 dilution 
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10. Eukaryotic cell lines 
a. State the source of each eukaryotic cell line used. 


b. Describe the method of cell line authentication used. 


c. Report whether the cell lines were tested for 
mycoplasma contamination. 


d. If any of the cell lines used are listed in the database 
of commonly misidentified cell lines maintained by 
ICLAC, provide a scientific rationale for their use. 


>» Animals and human research participants 


No eukaryotic cell lines were used. 
No eukaryotic cell lines were used. 


No eukaryotic cell lines were used. 


No commonly misidentified cell lines were used. 


Policy information about studies involving animals; when reporting animal research, follow the ARRIVE guidelines 


11. Description of research animals 


Provide all relevant details on animals and/or 
animal-derived materials used in the study. 


No animals were used. 


Policy information about studies involving human research participants 


12. Description of human research participants 


Describe the covariate-relevant population 
characteristics of the human research participants. 


Population characteristics are presented in Extended Data Table 1. 
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anone was founded 

100 years agoona 

breakthrough concept of 
the time: that fermented foods 
and the bacteria they contained 
(not yet referred to as probiotics) 
could target the gut and its 
microbiome to bring health to 
all. One century later, Danone 
continues to place the gut and 
its microbiome at the core of 
its health strategy to deliver the 
company's mission “bringing 
health through food to as many 
people as possible”. Today, the 
portfolio of the Danone Essential 
Dairy & Plant-Based division 
encompasses the traditional 
range of fermented dairy 
products and probiotics. The 
portfolio was recently broadened 
to include plant-based products, 
expanding the possibility to 
develop dairy and non-dairy 
ranges of healthy products 
that will contribute to nourish, 
enrich and protect the gut and 
its microbiome. Here we present 
the history of microbiology and 
Danone and reveal our research 
priorities for the coming years. 


DANONE WAS BORN 
DURING THE MICROBIOLOGY 
REVOLUTION 

“The secret to longevity may lie 
in the microbiome and the gut”. 
“Why the gut microbiome is 
crucial for your health"?. These 
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Figure 1. Danone yogurt was packaged and sold in ceramic pots in the early twentieth century. Source: Danone. 


are two examples of headlines 
that were written over the past 
couple of years, but they could 
have been written one hundred 
years ago. By the end of the 
nineteenth century, the field of 
microbiology started to fulfil 

its first promises. Louis Pasteur 
discovered fermentation by 
lactic acid bacteria, which he 
described as beneficial bacteria 
to protect foods from spoilage. 
Despite the traditional use 

of fermentation in foods and 
beverages to improve storage 
time, safety, functionality, 
organoleptic quality and 
nutritional properties, it was 
Pasteur who demonstrated 


Sponsor 


that the lactate produced 
during fermentation inhibited 
the growth of pathogens and 
avoided food spoilage. The 
idea that not all bacteria were 
prejudicial was a breakthrough 
at the time. The scientific world 
was shaken by the promise of 
modern microbiology, which 
saw the rise of vaccines, the 
isolation of beneficial bacteria 
and, later in the century, with the 
discovery in 1928 of penicillin 
by Alexander Fleming, the 
development of antibiotics. 
Microbiology was seen by many 
as uncharted scientific territory 
holding promises to improve 
people's lives. 


retains sole 


responsibility for 


Building on Louis Pasteur's 
legacy, in 1899 Henri Tessier, 
a paediatrician from the 
Institut Pasteur, discovered 
Bifidobacterium bifidum in 
faecal samples of breast-fed 
infants and reported that the 
bacteria could be used to help 
prevent babies from developing 
diarrhoea. The Russian-born 
biologist Elie Metchnikoff, 
recipient of the Nobel prize in 
physiology or medicine in 1908 
for his work on phagocytosis, 
hypothesized that “there is a link 
between the gut flora and the 
senile degeneration’. The gut 
microbiome and its metabolites 
were linked to intestinal processes 


content 
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beyond food digestion and the 
absorption of nutrients. At the 
time, it was a breakthrough 
concept: the bacteria in our 
guts could dictate our fate. 
Metchnikoff also observed 
that food could be used to 
beneficially modulate the gut 
flora, in particular the bacteria 
contained in fermented foods’. 
His hypothesis was that if the 
lactate produced by bacteria 
during fermentation can inhibit 
the growth of harmful bacteria, 
then the same process could 
occur in the gut. Consequently, 
lactic acid bacteria could inhibit 
the ‘putrefaction’ and production 
of toxins in the gut. He worked 
to isolate pure cultures from a 
sample of Bulgarian fermented 
milk, among them Lactobacillus 
bulgaricus and Streptococcus 
thermophilus, the two species 
used today to produce yogurt. 

The history of Danone begins 
in 1916, when Greek businessman 
Isaac Carasso took his wife 
and three children to Spain, his 
ancestral home. Much of Europe 
was experiencing the turmoil of 
the first world war and although 
Spain remained neutral, famine, 
malnutrition and poor living 
standards were present in many 
layers of society. Carasso was 
deeply affected by the levels of 
malnutrition and disease that he 
saw in Spanish children. 

Carasso was open and curious 
about scientific developments 
and he was inspired by 
Metchnikoff's research into 
dairy ferments. Through 
experimentation and by trial 
and error, Carasso mixed fresh 
milk with ferments isolated at 
the Institut Pasteur to create in 
1919, exactly 100 years ago, what 
became the first Danone yogurt. 
He named the yogurt after his 
son, Daniel, and in reference to 
his nickname ‘Little Daniel’. The 
yogurt was packaged in porcelain 
pots (Fig. 1), which Carasso 
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hand-delivered as a health food 

to pharmacies across the city. 

In 1924, the production site was 

visited by the College of Doctors 
of Barcelona who complimented 
the product?. 

From pharmacies, yogurt 
made it through to other 
channels accessible to a range 
of consumers and became 
a mainstream product. One 
decade later, Daniel joined the 
family business and successfully 
expanded Danone across France. 
In the late 1930s, with Europe 
once again on the brink of war, 
he decided to move to the United 
States. In the years following 
the first world war and, later, 
the second world war, yogurt 
was consumed as a way of 
providing healthy nutrients. The 
fermentation process means that 
yogurt contains higher quantities 
of vitamins B2, B6 and B12, and 
the nutrients calcium, potassium, 
zinc and magnesium than other 
dairy products such as milk®. 
Yogurt promotes adherence 
to dietary guidelines for many 
nutrients, particularly those of 
concern, such as calcium. During 
fermentation, the breakdown of 
lactose by enzymes present in 
L. bulgaricus and S. thermophilus 
reduces lactose levels and 
facilitates the consumption of 
dairy products by individuals 
who might be sensitive to 
lactose levels found in milk’. 
Recent population studies have 
suggested that yogurt might not 
only be a significant contributor 
in the provision of micronutrients 
but is also associated with a 
reduced risk of weight gain and 
increased satiety and that yogurt 
consumption might be linked to 
an overall healthier dietary and 
lifestyle pattern®. 

Building on Pasteur 's heritage, 
scientists in the twentieth 
century isolated other health- 
promoting strains from the 
human intestine among which 
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were Escherichia coli Nissle 
(1917), Lactobacillus acidophilus 
LB (1907) and Lactobacillus casei 
Shirota (1935). In 1974, Richard 
B. Parker proposed a definition 
of probiotics as “organisms and 
substances which contribute to 
intestinal microbial balance’. 
An official definition was 

issued in 2001 by the Food 

and Agricultural Organization 
of the United Nations and the 
World Health Organization as 
constituting “live microorganisms 
that, when administered in 
adequate amounts, confer a 
health benefit on the host”. 
Different types of bacteria 

(for example Lactobacillus, 
Bifidobacterium, Streptococcus, 
Bacillus) and yeast or mould 

(for example Saccharomyces, 
Aspergillus, Candida) are qualified 
as probiotics. 

The first foodstuff to contain 
probiotics was probably 
fermented milk, but today many 
other matrices can contain 
probiotics. Danone started the 
production of two probiotic- 
containing fermented milks with 
Bifidobacterium lactis CNCM 
|-2494 (BFMP) and L. casei 
CNCM I-1518 (LFMP) in 1987 
and 1994, respectively named 
Activia® and Actimel®. With the 
ambition to build and substantiate 
the health benefits of probiotics, 
avast programme of studies was 
launched in the 1990s, resulting 
in more than 40 peer-reviewed 
scientific publications. The 
studied benefits of probiotic 
products range from reducing the 
incidence or duration of digestive 
discomfort symptoms, intestinal 
transit, common infection and 
illness, respiratory infection, 
gastrointestinal infection and 
prevention of diarrhoea following 
the use of antibiotics. Specifically, 
the results accumulated for 


BFMP showed a consistent 
and significant improvement 
of outcomes related to 


gastrointestinal discomfort in 
healthy adults'°" and a decrease 
in the severity of symptoms in 
people with IBS-C”. 


THE SECOND 
MICROBIOLOGY 
REVOLUTION FED DANONE’S 
GUT HEALTH RESEARCH 
Scientific discoveries often 

rely on the development of 

new technologies. The birth 

of modern microbiology at 

the end of the nineteenth and 
beginning of the twentieth 
century makes no exception. 

The first microbiology revolution 
was possible because of the 
development of new technological 
tools, such as the microscope. 
The advancements in genomics, 
proteomics and epigenomics 
have propelled the field into a 
second revolution that will allow 
researchers to understand the 
precise mechanisms of action 

of the bacteria in the gut. The 
metagenomics sequencing tools 
available today represent similar 
technological advancements 
because they allow scientists to 
rapidly capture the complexity of 
microbiomes and study invisible 
microorganisms without the need 
to isolate them. 

At the forefront of this 
second revolution were the 
European and United States 
consortia of laboratories, 
MetaHIT (Metagenomics of 
the Human intestinal Tract, a 
project financed by the European 
Commission, 2008-2012) and 
Human Microbiome Project 
(2007-2013), respectively, 
which have led the way and 
developed bioinformatics tools 
and concepts that have unlocked 
microbiome research. Inspired by 
its microbiology heritage, Danone 
Nutricia Research contributed to 
this frantic endeavour by joining 
the MetaHIT consortium, which 
was seen by many as a scientific 
gamble at the time. 
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Building on the emerging gut- 
centred science, Danone Nutricia 
Research rapidly focused its 
research efforts on understanding 
the mode of actions of its major 
probiotic-containing product 
targeting gut health: BFMP. 

After a decade of a broad range 
of collaborations with leading 
academics including those from 
Washington University at St. 
Louis, U.S., Harvard T.H. Chan 
School of Public Health in Boston, 
U.S., the University of California, 
Los Angeles, U.S. (UCLA), 
Institut National de la Recherche 
Agronomique, headquartered 

in Paris, and the University of 
Gothenburg, Sweden, we are 

able to provide a working model 
capturing the different potential 
mode of actions of BFMP" 

(Fig. 2). The evidence supports 
the hypothesis that BFMP confers 
health benefits by modulating the 
activity of the gut microbiome and 
the components of the gut-brain 
axis. On the gut microbiome, 

we have evidence showing that 
BFMP stimulates the pathways 
leading to the production of 
beneficial short-chain fatty acids 
and decreases the production of 
intestinal gas®"*, On the gut-brain 
axis, results obtained in different 
studies support the idea that this 
product decreases the visceral 
hypersensitivity and modulates 
activity of the brain’". 

During our research we 
found that not all individuals 
respond to BFMP to the same 
extent. Recently, we have 
shown that the decrease in 
intestinal gas production by 
BFMP was observed only 
in individuals exhaling high 
quantities of hydrogen’*. We 
also observed evidence for 
two types of microbiomes: 
permissive and resistant'®. 

The permissive microbiome 
was more susceptible to be 
modulated by BFMP than the 
resistant microbiome. Together, 
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Figure 2. The potential mode of action of BEMP (Activia® ). Source: Danone / Boris Le Nevé. 


the results open the avenue for 
the ‘precision probiotics’ era. In 
other words, probiotics that will 
be selected and tailored to an 
individual's phenotype, genetics 
and microbiome. 

Future investigations should 
focus on the mechanistic 
understanding of the benefits of 
probiotics. Such understanding 
will not just impact gut health 
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but many aspects of health, from 
the neurological to the immune 
system, from paediatrics to 
geriatrics. 


100 YEARS FROM NOW: 
CHALLENGES AND HOPES 

In today's world, just as Isaac 
Carasso found 100 years ago, 

we face many challenges. Ina 
world in constant and accelerated 
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change and continuous 
interconnections, human health is 
interconnected through effects of 
the environment, especially diet, 
the microbiome and the intrinsic 
and specific background of each 
individual. 


HEALTH THROUGH THE GUT 
For decades, the scientific 
approach to health was 
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compartmentalized and focused 
on studying specific organs or cell 
types. Today, we are witnessing 
the emergence of a new era with 
the rise of integrative science, 
which defines health as the result 
of the adequate functioning of 
interconnected organs, organ 
systems or cell types. The 
intestine has been highlighted as 
one of the most interconnected 
organs because it is involved 

in mental, cardiovascular, 
metabolic, immune and 
musculoskeletal health and is 
essential for health and plays an 
important part in disease”. 

Scientists are (re)discovering 
the gut as a cornerstone of human 
health and well-being because 
the function of the gut goes way 
beyond its primary function of 
digesting and absorbing nutrients. 
The gut is mediated by four 
different mechanisms: i) signalling 
molecules produced by the gut 
make it a virtual endocrine organ, 
arguably the largest that can 
contribute to circulating signalling 
molecules within the host; ii) 
bacteria-derived compounds (for 
example, short-chain fatty acids, 
lipopolysaccharides, vitamins, 
and polyphenol metabolites) 
able to cross the epithelial barrier 
contribute or interfere with the 
functioning of extra-intestinal 
organs or tissues; iii) immune 
cells primed in the intestine and 
homing in extra-cellular tissues 
and; iv) a nervous system able 
to transmit signals from the gut 
into the brain, interfering with its 
activity. 

Because the gut is important to 
human health, it is not surprising 
that more studies suggest the 
potential benefits of probiotics 
beyond the gut to include the 
brain, cardiovascular, metabolic, 
bone, muscular and immune 
systems. In 2013, Danone Nutricia 
Research and collaborators 
from UCLA" published the first 
gut-brain probiotic clinical trial 
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in collaboration with scientists 
showing that a four-week intake of 
BFMP by healthy women affected 
the activity of brain regions that 
control central processing of 
emotion and sensation. More 
recently, we published a meta- 
analysis of 105 publications 

and demonstrated that the 
consumption of probiotics is 
associated with improvements 

in several metabolic risk factors 

in subjects with metabolic 
diseases”. Probiotics seem to be 
relevant candidates to bring health 
through the gut. 

In line with our findings, we 
firmly believe in the concept of 
‘Health through the gut’, which 
places the gut and its microbiome 
as an important target for health. 
More than one hundred years 
after Metchnikoff's insights, 
the idea that small molecules 
produced by the gut microbiome 
can modulate health is highly 
relevant and has catalysed a new 
gold rush in the gut microbiome 
field. The focus on health through 
the gut will allow the development 
of new solutions, including 
probiotics, to improve personal 
and hopefully contribute towards 
solving public health challenges. 


PRECISION PROBIOTICS 
Scientific and technological 
advances in the gut microbiome 
field hold promises for a new 
way to select the next generation 
of probiotics, named hereafter 
‘precision probiotics’. Precision 
probiotics will bring to the gut 
activities or functions that are not 
provided by our gut microbiome 
or our own genes. Precision 
probiotics will be tailored to 
a recipient's diet, phenotype, 
lifestyle, age, gender, genetics 
and microbiome. 

The first precision probiotics 
were L. bulgaricus and 
S. thermophilus, the bacteria 
used for yogurt fermentation, 
because they allow people who 
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are lactose intolerant to consume 
yogurt by providing the lactase 
(an enzyme missing in people 
who are lactose intolerant) 
to convert lactose into well 
tolerated glucose and galactose. 
In the future, key species or 
functions of the gut microbiome 
will be identified by the scientific 
community, and probiotics will 
be a natural way to restore or 
support some of these functions. 
Probiotics could originate 
from our traditional lactic acid 
bacteria or Bifidobacterium 
but they will also come from 
bacteria isolated from the gut. 
Akkermansia municiphila is a 
gut-isolated bacterium that holds 
promises in the area of metabolic 
health because research suggests 
it improves several metabolic 
parameters in obese and 
overweight individuals’. 


OPEN SCIENCE 
There is huge potential 
for probiotics and the gut 
microbiome to contribute to 
public health, therefore one 
company and one research 
team will not suffice to explore 
and exploit all possibilties. A 
new way of thinking is required, 
and that involves multiple 
partners working together. The 
Danone vision is collaboration 
and openness that allows 
complementary factors to work 
together towards common goals. 
To celebrate 100 years since 
the creation of its first yogurt, 
Danone has opened its collection 
of 1,800 strains for research 
purposes”°. The 193 lactic and 
Bifidobacterium ferment strains, 
as well as Danone's collection of 
1,600 other strains, have been 
made available to researchers 
around the world, with the sole 
aim of sharing our legacy to 
benefit all. The bacteria strains in 
Danone’s collection may have a 
range of additional uses, many of 
which have not yet been explored 


to their fullest potential. The 
strains could help to address 
aseries of health, societal and 
environmental challenges”°. 

Additionally, and due to the 
complexity and diversity of the 
microbiome, Danone Nutricia 
Research decided to collaborate 
with world citizens and open 
the possibility for citizens to 
help understand the complexity 
and diversity of the world’s gut 
microbiomes. Traditionally, 
research on the gut microbiome 
focused on populations from 
the Western world because we 
know the diet and how a variety 
of foods can modulate the gut 
microbiome. Danone Nutricia 
Research partnered with the 
University of California, San 
Diego, U.S. to understand the 
best ways of nourishing the gut 
microbiome. By launching The 
Human Diets & Microbiome 
Initiative (THDMI) we aim to 
discover the best diets and foods 
consumed by humans in different 
parts of the world that can benefit 
our guts. Because of the high 
variability in gut microbiome 
composition between individuals, 
it is necessary to achieve the 
recruitment of several cohorts 
across all geographical regions 
to enable a full mapping of the 
gut microbiome combined with 
dietary data. Most importantly, 
unlike other similar initiatives, 
this programme is using full 
metagenome sequencing, 
which is the latest sequencing 
technology, to assess the gut 
microbiome composition and 
function. 

The THDMI programme 
has a public health educational 
vocation and will be conducted 
to contribute to promote healthy 
dietary habits. The THDMI 
initiative will give participants the 
opportunity to become citizen 
scientists and to contribute 
to the ‘health through the gut’ 
revolution. 
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BRINGING IT ALL TOGETHER: 
THE GUT MICROBIOME AND 
DANONE SHARED HISTORY 
The entrepreneurial spirit that 
inspired Isaac Carasso to search 
for solutions for malnourished 
children is as essential today as 
it was 100 years ago. And it is 
this spirit that propels us into the 
future at Danone. Entrepreneurs, 
in the purest sense, are those 
who identify a need - any need 

- and fill it. Every entrepreneur 

is a problem solver, someone 
who sees a problem affecting 
people and is driven to finda 
solution. Inspired by Carasso’s 
entrepreneurship, openness 

and curiosity, research and 
innovation teams at Danone 


Nutricia Research share the 
same passion for innovation. 
Danone researchers use their 
scientific and technological 
expertise to create products 
that best fit the Danone mission 
to bring health through food 

to as many people as possible. 
The science and technology for 
developing probiotics has moved 
at the same pace as the science 
in the field of the microbiome. 
The gut microbiome history 
reflects the Danone history for 
food and health. 
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